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CAATAATAAATTTTACTGTCGTTGACA 
CGTCCGTGTTGCAGTCGATCATCAGCJ 
CCTTGTTCTTGGTGTCAACGAGAAAAl 

AAGAGGCCCTATCGGAGGCACGTGAACACCTCAAAAATGG^ 

gccggtggtcatagctatggcatcgatc^ 



TGAAGATTATGAACAAAACTGGAACACTAA^ 
ATGGAGGTGCAGTCACTCGCTATGTCGACAACAAT 

AAAGATTTTCTCGCACGCGCGGGCAAGTCAATGTCC^TCTTTCCGAA^ 

gagaggtgtctactgctgccgtgaccatgagcatgaaaSgS C cttgattacatcgagtcgaa 



AACAAGGCAACCAAT^TCCCATGCAAAAGGA^ 
A^AGGGTCTTGTTTGCTT^ 

ATGCAGGTGAGGACATCCAGCTTCTTAAGGCAGCATATGAAAATT 
CCATTGTTGTCAGCAGGCATATTTGGTGCTAAACCACTTCAGTCT 
TACACAGGTTTATATTGCAGTCAATGACAAAGCTCTTTATGAGCA 
TGAAGCCTAGAGTGGAAGCACCTAAACAAGAGGAGCCACCAAACA 
TCTGTCGTACAGAAGCCTGTCGATGTGAAGCCAAAAATTAAGGCO 



taca^ggtttatSSa"™ 
tgaagcc?Igag?5gaaScS 
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CACTTGAGGAAGCTAAGACTGCTcSaAG^^ 
AATCKTAAC^AAGAGATTCTAGGAACTC^^ 

aagaaaattaatgcctatatgSS 

TTAAAATTCAAGAGGGCATCGTTGACTATGGTGTCrr 

sssgssssssssssss^ 

TAATTTGGAATCTCCAAAGCGAGTTCT^ 

gagtacaaaogaStcS^^S^ 

AAGACAACGTTCAAACCAAACACTTGGTGTTTACGTT 

TTCATTTGAAGTTCTGGCAGTAGAAGACACartA^f^^^ GAGTA CAAAGCCAGTAGATACTTCAAA 

gaattagagcttcaSIS^ 




CCTATTCTGTTGCTT<»CCAaS^^ 

cagckacaocgao^agcaaa^xagctSaSSc^^SSSS^- 
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rATGCTTAAGGCCACATTATT 
TGTCAATCCATGATGGTTACJ 5 



TGTACAATTTTT 
TATTTCTTATAG 
ACACTTACCTGG 
GAAAGGTCAGAA 
TCTATCAGGAGT 
CTGTGGGTGCTT 
GCCTACTACTTT 



rGTTCTATTTCTTATAG 
'CCTAACACTTACCTGG 

;atgcgaaaggtcagaa 
gagctctatcaggagt 

CAACCTGTGGGTGCTT 

ATGAAATTCAGACGTGTTTTTGGTGAGTACAACCATdTTGTTGCTG 



ACACTTACCTGG 
iAAAGGTCAGAA 
'CTATCAGGAGT 

TAGATGTGTCTGCTTCAGTAGTGGCTGGTGGTATTATTGCC 



kCATGCG; 



AGGGTTCTGTTAGAGTAGTAACAACTTTTGATGCTGAGTACTGTAGACATGGTACATGCGAAAGGTCA 
JIJ^r TGCCTATCTACCAGTGGTAGATG ^ TO ^^^ 



™ tcot *ggaaaagagtcatgotaatggagttacatttact^^ 

1=g££^^ 



"GG 



S^ c S A S^o AAAGTTGATACTTCTAACCCTAAG ^ 

A°™s^ CAG r cTAGc ^ 



ACTTTG 
ATTGGG 
AGAATG 
GTTAGA 



3ATCATGTTGACATA1 
3AAAGAGCTGCTGCAG 



AGATCAAGCTATTTCCATGTGGGCCTTAGTTATTTCTGTAACCTCTAACTATTCTGrTPTf^^^ar^»om^ 



S?^ A ^ AGGTAAACCATCTATCAAGGTTGCTACTG ^ 
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CACTTGTGATGGTAACACCTTCACATATGCATCTGCACTCTC 



GTCCAGTAGCACTACGACAGATGTC 
CTTGCCTACTATAACAATTCGAAGG 
ATGGGCTAGATTCCCTAAGAGTGAT 
TTACAGACACACCAAAAGGGCCTAA 



^^^"^^^^TACAGAATMTGMCTGAGTCCAGTAGCACTACGAraaTGTC 

lTGCACT 

ITCAAAT 

^r^TiEZ^^^^^^CW^^CTAAATAGAGGTAT^TOCTOGGCAGJTTAOTTC 



TTACTGTAACACCAGAAGCI 
TGCCACATTGACCATCCAA2 
TTGTGCTAATGACCCAGTGG 
ATGGCTGTAGTTGTGACCAS 



TGGGTTTTACACTTAG 



:gt 

*GA 
,TG 
l TA 

aScaS^™^ 



:acgt 

'GTGA 
JTATG 



3AAATATGATTTTACGG 

ktcccaattgtattaac 
gtaagaaaaatattogtagatggtgttccttttgttotS^^ 



cgtacataatcaggatgtaaacttacatagctcgcgtctcagtttcaaggaact^ 

A^CCAGCTATGCA^AGCTTCTGGC^^ 

^CCCGGTAATTTTAATAAAGACTTTa 
?TGAACTAAAACACTTCTTCTTTGC1 
ATCTGCCAACAATGTGTGATATCAG 
'TACGATGGTGGCTGTATTAATGCCA 
'TAATAAATGGGGTAAGGCTAGACTT 
MACTAAGCGTAATGTCATCCCTAC 
iGCTCGCACCGTAGCTGGTGTCTCTA 
lGTCAATAGCCGCCACTAGAGGAGCT 
lTGTTAAAAACTGTTTACAGTGATGT 

.gccatgcctaacatgcttaggataa 
atcacaccgtttctacaggttagct 
•cactatatgttaaaccaggtggaac 
atttgtcaagctgttacagccaatg 
tgtccgcaatctacaacacaggctc 
atgagttttacgcttacctgcgtaa 
aacagtaactatgcggctcaaggtt 

TAATGTGTTCATGTCTGAGGCAAAA 1 
GTGTAOPTr r rrvmnnn ^ * CACAGCATACAATGCTAGTTAAACA 



^Z™^^«™^?^!^?TT?^ A ^ C ^^*^^ C ^ T ^TTTTAATAAAGACTTTTATGACTTTCCTCT 

k 
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GTCTAAAGGTTTCTTTAAGGAAGGAAGTTCTGTTGAACTAAAACACTTCTTCTTTGCTrAOPA'TPr.n^oo 

TGGTGGCTGTATTAATGCC2 
AATGGGGTAAGGCTAGACT1 
AAGCGTAATGTCATCCCTAC 
CACCGTAGCTGGTGTCTCTA 
TAGCCGCCACTAGAGGAGCT 
AAAACTGTTTACAGTGATGT 
3CCTAACATGCTTAGGATAA 
kCCGTTTCTACAGGTTAGCT 
TATGTTAAACCAGGTGGAAC 
rCAAGCTGTTACAGCCAATG 
3CAATCTACAACACAGGCTC 



:tag£ 

wTCCC 
•GTCl 
AGGA 
.GTGA 
AGGA 
GTTA 
GTGG 
GCCA 



CTACTA' 
TCTATC 
AGCTAC 
ATGTAGJ 
ATAATG< 
AGCTAAt 



'AG 
'GG 
.CT< 
TC 
GT< 



^??^??^? aMAGTCC ^MTaOAGCTCGCACCGTAGCTGGTGTCTCTATCTGTAGTACTAT 

rCCACTAGAGGAGCTACTGTGGTA 
'GTTTACAGTGATGTAGAAACTCC 
.CATGCTTAGGATAATGGCCTCTC 



S?^!5??^2II? CATCA ^ TTil ^^AATAGCCGCCACTAGAGGAGCTACTCTGGTAATTG 

2TGTTT 

~~ — ~~ iw^woyu^n j» VjrV^ V«.'X"AACATG 

TS^^^^^^^^^^^^^^^^'^^^'^^-^ATCACACCGTTTCTACAGGTTAGCTAACGAGTCTCCGC 
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AGTCACATAAGCCTCCCATTAGTTTTCCATTAT 



'ACACCTTTGAAAA 
'GGTGATTACTTTG 
tCTATGTGAGAATT 

'atcaaaaggtcgg 
atcggacttgctc 
icctatgtgaaaag 

rTAGAGTGTTTTGA 
CCAGAAACAACTG 



^^S^F^E^^^^^^A^^^^A^GTAAAAATAGTAAAGTACAGATTGGAGAGTACACCTTTGAAAA 

CA 

01 

PA 

3T 

2A 

AC 

^ a ? attgtagtctotgat g^^ 



AGG ^ GA E TA ^ TCATGCTC ^GTACAGA^ACTA^ 

3ATGAGTTTTCI 
ACCTGGTACTGG 
2GGCATGCTCTC 
_ - - - ^ „ , j. ^ j. AGTAGAATC ATA 

TAAA.TTCAAAGTGAATTCAACACTAGAACAGTATGTTOTCTGCACTGTAAATGCATTGCCAGAAAC 



^ A E A T CTCACACTCT ^ TGC CACTTAGTGCACC T AC T CTAGTGCC^ 

'AG 
}TA 

GCATTAAAATATTTGCCCMAGATAAA^^ 



^CTTGTC^C^ 

AAATGTTCTACAAAGGTGTTATT 
AGAGAATTTCTTACACGCAATCC 
rGTAGCTTCAAAAATCTTAGGAT 
TCATATTCACACAAACTACTGAA 
3CAAAAATTGGCATTTTGTGCAT 

^AACTGGACTTTTTAAGGAC^^ 



^^S^^^^^^CTTCAAAATGTTCTACAAAGGTGTT^ 

AGAATTTCTTAC 
FAGCTTCAAAAfi 
ATATTCACACA2 
■ - - - - ~~ — — ^ * _3 <o o « AAAAATTGGC AT 

TTTATGACAAACTGCAATTTACAAGTCTAGAAATACCACGTCGCAATGTGGCTACATTACAAGCAGAAAAr 



AAAATCTTj 

. " " - - *w x ,l %or x iAli ViBAj AC AAACTA< 



3gctgtcatgc 
:ttagtagctg 
:tccaccaggt 
vttaagatagt 

5CATGGCTTTG 
^CAAACGTGCA 
PATGTCTATAA 
rCAACATTGCC 
:CCATGAGTGC 
iATTCTGCTTG 
7CATGACATTG 

:tcagccatgt 

?TCACTGATGG 
JTTTGACACAA 
wTGCATTCCAC. 

'ctgatagtcc 
'acgtgtatta 
'ggatgcatat. 
:tgtggaatac 
gatggacacg( 

.TGTGGAGATC 
TTAAACCAGT< 
GACTACAAAA< 
ACCTACTGAGi 
TTAGAAACGC( 
GCACAAGCTAC 
AGTAGACGGCi 

AACTGACTTTCTCGAGCTCGCTATGGATGAATTCATACAGCC^TATAA 



A ^^ GAA 2 C T ATTCGTCACGTT CGTGCG^^^ 

•GTTA2 

:aaaac 

JTGCG1 
TGGGC 
•GTGTG 
'TTGAC 
CATGA 
AGCAG 
GGGTT 
GTTCT 
CGATG 
ATAAA 
TGTAG 
TAAGC 
ACTAT 
TCTGC 
3TACT 
ATAAG 

TATTG 
3TAAG 
ATCTG 

:aaga 

^CCTT» 
3GACC 

^^^^S C ? AC ^ 



:accag 

AGAT2 
'GGCTl 
ACGTG 
rTCTAI 
.CATTG 
.TGAG1 
CTGCT 
GACAT 
GCCAT 
CTGAT 
GACAC 
ATTCC 
ATAGT 
TGTAT 
TGCAT. 
SGAAT. 
3GACA 
3GAGA 
AACCA< 
TACAA 
TACTGJ 
3AAAC< 
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taat^gtcacaagatttgtcagtgatttcaaaagtggtcaaggttac^ttgaSa 

ACCAGGTGTTGCGATGCCTAACTTGTACAAGATGCAAAGAATGCTTCTTGAAAAGTGTGACCTTPan^S^ 



GGGTGGTTCTS 

^™ TGCATCATCATC ^ G CATTTTTAATTGG^^ 



GTTTTTCACTTATCTGTGTGGATTTATAAAGCAAAAACTAGCCCTGGGTGGTTCTATAGCTGTAAArAT-^A 
CAGAGCA ™ C * TGGAATGCT ^^ 



J GA ^ CTATACCATCCATGCTAACTACATTOT CTGGAGGAA^ 

:tgtcatacctttta 
'ttggttctaccatg 

lGCATGTAACTTTGA 

ATGCATTTAATTGCACTTTCGAGTACATATCTGATGCCT 



i TTCT 

'TTTA 
ICATG 
'TTGA 
!GATA 



AG ™™™ ATOG ^ 

aacaacaagtcacagtcggtgattattattaacaattctactaatgttgttatacgarS??; 0 ^ 

ATTGTGTGACAACCCTTTCrrra^ 



^ A ? A ^ G r cGTGATcTAccTTcTGGTTTTAAcAcTTTC 



LGATTGACA 
lATATTACA 



AGATGCTGTTGATTGTTCTCAAAATCCACTTGCTGAACTCAAATGCTCTGTTAAGAGOTTTGACATTrAra 

aaggaatttaccagacctctaatttcagggttgttccctcaggagatgSgt^ 

AA ™ G r CCTTTOGGAGAGGTTTTTAATGCTACT ^^ 



ATGA 
'TATA 
CCCT 
GCAC 



?™^ AAGACAA ^^ 

'TATA 

:ccct 

T^^^ TACCAACCTTACAGAG ^^ 



TTTCATGGGTTGTGTCCTTGCTTCGAATACTAGGAACATTGATGCTACTTCAACTrCTAATTAT'AanirpjvrTiTv 

aatataggtatcttagacatggcaagcttaggccctttgagagIScSS^ 
gatggcaaaccttgcaccccacctgctcttaattgttattggccStS 



GC TTTTGC AC ACAACTAAAT 



X3TG< 
l GCT< 
'TGC 

^ GCA ^ gag . ttaccca ^ tg ^tctatgagaacca^caaatcg^ 



C ^ GA ^ ACA ^ G T GCT ^^^ 



GATTAGTCAAATTCAAGAATCACTTACAACAACATCAACTGCATO 



cc 
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CAGGG 
CAAAA 
GTCTT 
AGGCA 
ACTTC 
ATCAT 
CTTCA 
TTCAA 
TTGGG 
CATCG 
GTGGT 
TAAAC 
GACAA 
ATGGC 
3ATGG 
rCACA 
^CAATV 
TACTT 

nnv,A\3iGT C AGi 

CA ? T *CAACACCAAAACTC^ 



AAGAGTTGACTTTTGTGGAA 
TCCTACATGTCACGTATGTG 
AAAGCATACTTCCCTCGTGA 
CTTTTCTCCACAAATAATTA 
TTAACAACACAGTTTATGAT 
AAAAATCATACATCACCAGA 
AAAAGAAATTGACCGCCTCA 
GAAAATATGAGCAATATATT. 
GTCATGGTTACAATCTTGCT 
TTCTTGCTGCAAGTTTGATG 
CGAACTTATGGATTTGTTTA' 
ATGCTTCTCCTGCAAGTACT< 
"TTGTTATTGGCGTTGCATT' 
3CAGCTAGCCCTTTATAAGG< 
kTCTTTTGCTTGTCGCTGCA< 



JGTATGT 
!CTCGTG. 
ATAATT. 
TTATGA 
CACCAG 
CGCCTC 
ATATAT' 
TCTTGC 
TTTGAT< 
TTGTTT, 
AAGTAC 
TTGCAT f 



E^~T?^S??^9^ a ^* a< ^*^^^^ T( ^ T GTTCTCTTCCrACATGTCACGTATGT<3CCATCCCAGGA 

lGCATACTTCCCTC 

'TTCTCCACAAATA 

ACAACACAGTTTA 

AATCATACATCAC 

AGAAATTGACCGC 

AATATGAGCAATA 

ATGGTTACAATCT 

TTGCTGCAAGTTT* 

ACTTATGGATTTG 

CTTCTCCTGCAAG' 

GTTATTGGCGTTG 



CTA 
CCT 
TGT 
ATG 
AAA 
TTG 
AGG. 
VGA 
3TT 1 



lCAA 
ATC 
lGAA 
ATA 
>TGG 
'TGC 
CTT. 



CAGATG 
CTCAA1 
TATTAA 
TGCTTT 

GATGAG 



jaccgcctc 
5caatatat' 
:aatcttgc 



Ti 



ACCGAATGTGCAAAT 
ATGAGCCGACGACGA 
GTTTCGGAAGAAACA 
AGTCACACTAGCCAT 



CAATTTA 
TACTCAT 



g ^S agcgtgcctttgt ^cacaagaaagtgagtacgaacttatgtactc^^^ 

^ C ™^ AG ™ AATAGCGTA ™^^ 



?TGCTCGTAC 
^CAATTGTGA 
ATGGCCGGA 
rAACGCTTTC 

:gctaccgta 

iCAGTAAGTG 
.TGAGGACTT 
'CCTCTAACT 



ATCATTCGTGGTCACTTGCGAATGGCCGGA 
GATCACTGTGGCTACATCACGAACGCTTTC 
CAGGTTTTGCTGCATACAACCGCTACCGTA 
GACAATATTGCTTTGCTAGTACAGTAAGTG 



IATCACGAACGCTTTC 
'ACAACCGCTACCGTA 

i CAA ^ TGTTTCAi "^^ 



AACTTTTCATCAGACAAGAGGAGGTTCAACAA 
GTATTTTTAATACTTTGCTTCACCATTAAGAG 
rTGTGCTTTTTAGCCTTTCTGCTATTCCTTGT 1 
CCAGGATCTAGAAGAACCTTGTACCAAAGTCT. 
-TCTATGCAGTTGCATATGCACTGTAGTACAG 
rGTAAGGTACAACACTAGGGGTAATACTTATAi 



rTTAGCCTTTCTGCTATTCCTTGT 
TAGAAGAACCTTGTACCAAAGTCT 



GTCTAAACGA 



CTTQGTTCACAGCTCTCACTCAGCATGGCAAQGAGGAACTTMATTCCCT^SAGGCCAGGQCQTTCCAATC 



CCAATAATACTGCGT 



FIGURE 3G 



WO 2004/096842 A 

PCT/CA2004/000626 

13/55 



'GGTAAAGGCCAACA 
fCCAAAAACGTACTG 
'AAGGAAATTTCGGG 
.TTTGCTCCAAGTGC 
GCTGACTTATCATG 
ACAAGCACATTGAC 
GAAGCTCAGCCTTT 
TGATTTCTCCAGAC 



JGTACTG 
l TTCGGG 
AAGTGC 
ATCATG 
ATTGAC 
GCCTTT 

AACTTCAAAATTCCATGAGTGGAGCTTCTCCTGATTCAA 



ii 



AATTGCACAAT 1 



GenBank Accession No. AY274119.1; SEQ ID NO: 1 
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AG^CGAGT^CTCGTCCCT^^ 

GAGCCTTGTTCTTGGTGTCAACGAGAAAAC 
TGCTAGTGCGTGGCTTCGGGGACTCTGTGG 



^*^^^^^^"^*^'^®^'^'^ACCGAAAGGTAAGATGGAGAGCCTTGTTCTTGGT(3'PrftftrranB „ „ > - 



* ^-GCCACATGTGGGC GAAACC CCAATTGCATACCGCAATGTTCTTCTTCGTAAGAACGf5TA in-ia^ 



HSCAGATTATCACAACCACT 



AATGCCATGTCCTGCCTGTCAAGACCCAGAGATTGGACCTGAGCATAGTGTTGCAGATTATCArAArr-aprp 

caaacattgaaactcgactccgcaagggaggtaggactagatgttt?SaS 



tGTCCTGCGGTAACTA 
'CAGTTTTAACACCAC 



TGTGTGGTTTTCCCTCACAGGCTGCTGGTGTTATCAGATCAATTTTTGCGCr 



J™ ACGCCATGGTTTATACTTCAGACCTGCT ^ CC ^^ 



™™^ GGA ™ GAGG ^ AA ^ C ?T AGTGC 

kTCAAGGATTGTG 
'ATCGCTGGCGCA 



======== SS 

AAGTTGCGATCACTCAACTTAGGTGAAGTCTTCATCGCTCAAAGCAAGGGACTii; 



taaggacaaagaacaatactgcgcattgtctcctggtttactggctIc^ 

GG^GTGCACCAATTAAAGGTGTAACCTTTGGAGAAGATACTGTTTGGGA^GTTCAA 



AGAATCACATTTGAGCTTGATGAACGTGTTGACAAAGTGCTTAATGAAA^GTCCTCTGTC 

Sr^™ GGAAGAAGAAAT ^ 



JTT 
JTG 



? a ^ ag ^ gagcaatcag agattgagccagaaccag^ 



™™ GAAA 5 rAAGTTO 

ctcagaacatgcttac^ggtgaagatatgtctttccttgagaag^tgcac^ 



TACCACAACACT 

ATT 
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^ff^AAGAGATTC^^^ 

AAGAAAATTAATGCCTATATGCATGGATGTTAGAGCCATAATGGCAACCA^rcC^ACGTAAGTA^a^ao^^a 



ICTTAAAGCTCCTGCC 
- CGTCATCAAAGACATC 

ACA^GTTTCTTTGGCTGGCTCTTACAGAGATTGGTCCTATTCAGGACAGCGTACAGAGTTAGGTGTTGAATT 



TTTTAATCTTGAAGAGGCTGCGCGCTGTATGCGTTCTCTTAAAGCTCCTGCCGTAGTGTCAGTATCATCAC 



======== 

AGGTACATGTCTGCTTTAAACCACACAAAGA^ 

ATGGGCTGATAACAATTGTTATTTGTCTAGTGTTTTATTAGCACTTCAACA^ 



GCTTACAGTAATAAAACTGTTGGCGAGCTTGGTGATGTCAGAGAAACTATGACCCATCTTCT 



iTCA 
'GTC 
ATG 

^SSSI^ TAACACAAAATTTGCTGAT ™^ 



:aaccatcaagcctgtgtc 

CTTACTATACAGAGCAGCCTATAGAC^^^ 



S G ^ ACAA ^ GGACCAGTGACTGATCTTTTCTACAAGG ™ 
GTATAAACTCGATGGAGTTACTTACACAGAGATTGAACCAAAATTGGATGGGTi 



J C ^™r TCTCTCACATTCTTCCCAGACTTGAATGGC ^^ 
CAGCGAGTTTCAAGAAAGGTGCTAAATTACTGCATAAGCCAATTGTTTGGCACATTAAcS^^ 



jTAAAATTTTGGCTTAT 
"TAGCACAACGTGTGTT 
\AAAAGTACCAATTCTA 

:taaattatgtttggat 



lCGTGTGTT 
ICAATTCTA 

<^CGGCATTAATTATGTGAAG^ 



GAATTAGAGCTTCACTACCTACAACTATTGCTAAJ^TAGTCT^S 



TTGAAACCAT 
GTTTTGGCAT 



GGTTCTTTTCCTTGCAGCATTTGTTTAAGTGGATTAGACTCCCTTGATTCTTATCCAGCTCTTGAAAPPaT 

^ A Sf GACGATOTCATCGTACAAGCTAGACT ^ 

ATATGTTGTTCACAAAATTCTTTTATTTATTAGGTCTTTCAGCTATAATGCAGCT 



* G £ AA ^? TAGGATO ^^ 

aaajuvtggcgcgcttcacctctactttgacaaggctcgtcaaaagacctatcaSS 

TTTTGTCAATTTAGACAATTTGAGAGCTAACAACACTAAAGGTTCACTCCCTATTAATGTC 

ATGGCAAGTCCAAATGCGACGAGTCTGCTTCTAAGTCTGCTTCTGTGTACTACAGT^AG 

CCTATTCTGTTGCTTGACCAAGCTCTTGTATCAGACGTTGGAGATACTACT 

TGATGCTTATGTCGACACCTTTTCAGCAACTTTTAGTGTTCCTATGGAAAAACT?^ 
CAGCTCACAGCGAGTTAGCAAAGGGTGTAGCTTTAGATGGTGTCC^ 
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:gtagtgctgc 
:ataactacta 



o^^ ATCTGG ^ TOT ^ GACTACATGTCTTTAT CTGAACAGCTGCGT^^ 



iTGTACAATTTTT 

AGGGTTCTGTTAGAGTAGTAACAACTTTTGATGCTGAG — - _ X t- ATACAGTTTCCTAACACTTACCTGG 



TACTGTAGACATGGTACATGCGAAAGGTCAGAA 



G I AG ^™? G ^ CTACCAGTGGTA ^GG^ 

kACATCTTTACTCd 
rATTGCCATATTGGI 
ATGTTGTTGCTGCT-P 

TTTCACTATACTCTGTCTGGTACCAGCTTACAGCTTTCTGCCGGGAGTCTACTCAGTCTTTTACTTGTACT 



TOTCTGTGGTGTTGATGCGATGAATCTCATAGCTAACATCTTTACTCCTCTTGTGC^ 

tac^tgtgtctgcttcagtagtggctggtggtattattgccatattggtgS^^ 

ATGAAATTCAGACGTGTTTTTC^TGAGTACAACCATGTTGTTG™ 



'TTAACAA 
ITGTACCT 
iTAACAGG 
rAAGCAGC 
.CCACCAC 
AGTTGAA 
TATACTG 
CGCAAAT 

^^^^^^^^^^^^^^^^ 



ITTTGTGTACCT 

:agtataacagg 
'cgtgaagcagc 
.ccaaccaccac 
k5caaagttgaa 

AC AGX AT AC T G 



!AGG 
AGC 
'CAC 
GAA 

TCCAAGACATGTCATTTGCAC^ 



!^TC^CAAGG^ 

'ACT, 

GGGTGCATGGTACAAGTAACCTGTGGAACTACAACTCTO 



^ G r GCTCTATATAACAAGTACAAGTATTTCA GTGGAGCCT^ 
^ C ^S A ^ AGCAAAGGCTCTAAATGAC ™ GC ^ 



r GAAACA ™ CAG ^ CTAGCATGCTA C^^ 

mfgtggtagtgttgg 
-aacaggagtacacg 

^ggcatggctgtatgctg^ 



p^p A p AGCATTAAAGGTTCTTTC ^ 



ACTTTGGCCTTTTCTG 
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CTCTGCGGCITCTACCACACAAACjS^ 
GAGGTA<K TO TGra^cI™SAraS^ 



tttgcagtagaccctgctaaagoSatm^ 
gaagatgttgtgtacacacactgg^ta^a^gacaggcaattactgt 



AGTGGGTTTTACACTTAG 
ACCAACTCCGCGAACCCT 

C^GGCACAGGCACTAGTACTGATGTCGTCTACAGG^ 



CAGCGTCTAACTAAATACACAATGGCTGATTTAGTCTATGCTCTACG' 



GT 



GCAAAGTTCCTAAAAACTAA^GCTGTC^^ 
CTTTGTAGTTAAGAGGCATACTATGTCTAACTACCAACATPA 

^ACGTCATTTTGATGAGGGTAATTGTG 
rATTATTTCAATAAGAAGGATTGGTAT 
lGGTGAGCGTGTACGCCAATCATTATT 
'AGGCGTACTGACATTAGATAATCAGG 
'CACCAGGCTGCGGAGTTCCTATTGTG 

GCTGATCTCGCAAA^CCAC^^^^ 

cttcgaccgttattttaaatattgggaccagaca^ 



;cci 

'TA< 

attcatattactcattgctgatgcccatcctcactttg 



GTATCCTTCATTGTGCAAACTTTAATGTCTTATTTTCTACTGTGTTTC^^^ 
GTAAGAAAAATATTTGTAGATGGTGTTCCTTTTGT^ 
CGTACATAATCAGGATGTAAACtS^ 

T 

G 

C 

a 

3 

r 

\< 
-I 

:< 



ATTTTAATAAAGA 
AAACACTTCTTCT 

:aacaatgtgtgat. 
1gtggctgtattaa 
.tggggtaaggcta 
gcgtaatgtcato 

CCGTAGCTGGTGTi 
GCCGCCACTAGAG< 
AACTGTTTACAGT< 
CTAACATGCTTAG( 
CGTTTCTACAGGT' 
TGTTAAACCAGGTC 
AAGCTGTTACAGCC 
AATCTACAACACAC 
TTACGCTTACCTGC 
ACTATGCGGCTCA2 
TTCATGTCTGAGGC 
TACAATGCTAGTTJ 
-AGGCTGTTTTGTC 
^TTGATGCTTACCC 
-ATTAGAAAGTTAC 

TGAGGCTATGTACACACCACATACAGTC1TX3CAGGCWOTAGQTGCTTGTGTATTO 



ACGATGGTGGCT' 

AATAAATGGGGT, 

TACTAAGCGTAA 

CTCGCAC CGTAG* 

TCAATAGCCGCC 

GTTAAAAACTGT' 

CCATGCCTAACA* 

TCACACCGTTTC 

ACTATATGTTAAi 

TTTGTC AAGC TG r . 

3TCCGCAATCTA< 

TGAGTTTTACGC 1 : 

&CAGTAACTATG< 

^ATGTGTTCATG 1 : 
— — — - — ^ *j * j. 4. j. a. x ^ACAGCATACAATY 

GTGTACCTCCCTTACCCAGATCCATCAAGAATATTAGGCGCAGGC^TTTTG^^ 



TGCCAACCA? 
GACTTTATT-? 
CCTACTATA2 
CTCTATCTGT 
GAGCTACTGl 
GATGTAGAAS 
GATAATGGCC 
TAGCTAACGA 
3GAACATCAT 
CAATGTAAAT 
3GCTCTATGA 
-GTAAACATT 



S3^T^^^^???^^^^ AC ^^ ATTC ^ AC ^ T G GT( 3GCTGTAT u rAATGCCAACCAAGrAATCGT 

Gra 

AA1 
AGC 
CCA 
GTT 
CAT 
TCT 
AAA 
PGT 
TAC 
3CT 

rt^^^^^E^— ^^??? AG ^^^^^^^^^^^^^''3TCTTCATGTCTGAGGCAAAATGTTGGACTGAGA 



GTCTCTATCTGTAGT 
AGGAGCTACTGTGGT 
GTGATGTAGAAACTC 
AGGATAATGGCCTCT 
GTTAGCTAACGAGTG 
3TGGAACATCATCCG 
3CCAATGTAAATGCA 



lGCCGCCACTAGAG 

aactgtttacagt 
:ctaacatgcttag 
icgtttctacaggt 
tgttaaaccaggt 



EZ?^^T!^?^*'^*^^^^'--'^''agccgccactagaggagctactotggtaattg 

GTT 
CAT 
TCT. 
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JCGATAC^AACATGTGATTGGACT^^ 

GCGAAGTACTCTCTGACAGAGAATTGCATC^ 
^S A ™^ TACTCGTTACCGTCTAACTAAAAA 



AGTC 
^GCT 



TGAAAAG 



S CACAGAAGGATAAGTCAGCTCAATGCTTCA ^^ 

TGTCAACCGCTTCAATGTGGCTATCACAAGGGCAAAAATTGGCATTTTGTCCA^ 

tttatgacaaactgcaatttac^gtctagaaataccacgtcgcaatctSc^^ 
gtaactggactttttaagc^ctgtagtaagat^^ 

CAG S G T TCATATAAAGTOCAAGACTGAAGGATT ^^ 
A S^ A 2 AC * CATOT ^ 

acccgcgaagaagctattcgtcacgttcgtgcgtggattggctttgatgtagagggctg^ 
^atgctgtgc^tactaacctacctctccag^ 

™I A ^7 GACACTGAAAATAACACAG ^^ 



TATGATTGATGTTCAGCAGTGGGGCTTTACGGGTAACCTTCAGAGTAACCATGACCAACATTRnrAPP^zvo 

atggaaat^acatgtggctagttgtgatgctatcatgactagatgottagcaStc 



atggagtcacattaattggagaatcagtaaaaacacagtttaactactttSga^ota^ 
c ^ ca gttgcctgaaacctactt^ 

"ktoactttc^^ 
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'GAACTCAAATGCTC 
'TCCCTCAGGAGATG 
wCTAAATTCCCTTCI 
!TACAACTCAACATT 
'CTCCAATGTCTATG 
GTGTTATTGCTGAT 
AACATTGATGCTAC 
CTTTGAGAGAGACA 
GTTATTGGCCATTA 
GTACTTTCTTTTGA 
GAACCAGTGTGTCA 
TTCAACCATTTCAA 
GAAATATTAGACAT 
TGAAGTTGCTGTTC 
CACCAGCTTGGCGC 
GAGCATGTCGACAC 



TTCTGTCTATGCATGGGAGA 
CATTTTTTTCAACCTTTAAG 
TATGCAGATTCTTTTGTAGT 
TGATTATAATTATAAATTGC 
CTACTTCAACTGGTAATTAT 
GACATATCTAATGTGCCTTT 
ATTAAATGATTATGGTTTTT. 
TTGAACTTTTAAATGCACCG 
STCAATTTTAATTTTAATGG 
rCAACAATTTGGCCGTGATG' 
kCATTTCACCTTGCGCTTTT< 
3TTCTATATCAAGATGTTAAI 



TTAS 
GTAG 
ATTG 
ATT2 
CCTT 
TTT1 
CACC 
AATG 
TGAT 



TTGCTGATTATAATTATAAATTG 
GATGCTACTTCAACTGGTAATTA 
3AGAGACATATCTAATGTGCCTT 
3GCCATTAAATGATTATGGTTTT 
TCTTTTGAACTTTTAAATGCACC 
3TGTGTCAATTTTAATTTTAATG 



i 

* 

^!^^?™ CTCCTT ^TCAAAGAGA TO TCAACCATTTCA^ 



T?~5^??????^?^^^^^^^^AA^ACTAGGAACATTGATGCTACTTCAACTGGTAATTATAATTATA 

GAGi 

CCA' 

^^^?^^5^SS^^^??^^^^^^^^^^^^'^AA^AACCAGTGTGTCAATTTTAATTTTAATGGACTCACT 



CAACTAAGAGGTCTTTTATTGA^ 

■ I 
I 
i 

GATTAGTCAAATTCAAGAATCACTTACAACAACATCAACTGCATTGGGCAAGCTGCAAGACGTTGTTAACC 



^^^^mm^S^^^H^^T^^^^^^^^^^^^^A^A^ATCTCATTTGTGCGCAGAAGTTCAATGGACTTAC 

:tgct 

^^gag^ac^^ 



======= 
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5GCTACCAC 
^TCCCAGGA 
JTGTTTTTG 
tCAGACAAT 
'CTGCAACC 
TGATCTTG 
fAGGTCGCT 
/TGGCCTTG 
fTTGCATGA 
ATGACTCT 
GATTTTTT 
C ATGC TAC 



fCATACTTCCCTCGTGAAGGTGTTTTTG 
"TCTCCACAAATAATTACTACAGACAAT 
CAACACAGTTTATGATCCTCTGCAACC 
ATCATACATCACCAGATGTTGATCTTG 
.GAAATTGACCGCCTCAATGAGGTCGCT 
ATATGAGCAATATATTAAATGGCCTTG 
TGGTTACAATCTTGCTTTGTTGCATGA 
TGCTGCAAGTTTGATGAGGATGACTCT 



a ™ T5^??^ ATTACA ^ GAGG ^CTTCTTTTCTCCACAAATAATTACTACAGACAAT 

AACACAGTTTATGATCCTCTGCAACC 
TCATACATCACCAGATGTTGATCTTG 
AAATTGACCGCCTCAATGAGGTCGCT 
TATGAGCAATATATTAAATGGCCTTG 



•CACCAGATGTTGATCTTG 
ICGCCTCAATGAGGTCGCT 

GTATGTTTGGCTCGGCTTCATOGCTCGACTAATTGCCA— 



AAAAATTTAAATGAATCACTCaS^^^ 



TCTACAATGCATCAACGCA1 
CATTACTTTATGATGCCAAC 
AGTGTCACAGATACAATTGT 
AATTGGTGGTTATTCTGAGG 
AAGTTTACTACCAGCTTGAG 

acacacaatcgacggctcttcaggagttgctaa^cc^aIS 



lG( 



l TACTGAAGGT( 



SSSSSSSSSSSSSSSSSSS^ 



GG ! ACG !^ AG ™ AGCGT ^ 

TTAACGTGAGTTTAGTAAAACCAACGGTT1 
GTTCCTGATCTTCTGGTCTAAACGAACTAA 
ATGGCAGACAACGGTACTATTACCGTTGAG 
TTTCCTATTCCTAGCCTGGATTATGTTACT 
TAAAGCTTGTTTTCCTCTGGCTCTTGTGGC 
ATTAATTGGGTGACTGGCGGGATTGCGATT 
CGTTGCTTCCTTCAGGCTGTTTGCTCGTAC 
TCAATGTGCCTCTCCGGGGGACAATTGTGA 
^TCATTCGTGGTCACTTGCGAATGGCCGGA 
3ATCACTGTGGCTACATCACGAACGCTTTC 
2AGGTTTTGCTGCATACAACCGCTACCGTA 
3ACAATATTGCTTTGCTAGTACAGTAAGTG 
-AGAGATATTGATTATCATTATGAGGACTT 
VTAGTGAGACAATTATTTAAGCCTCTAACT 
3TTAGATTATCCATAAAACGAACATGAAAA 

:tatatcactatcaggagtgtgttagaggt 
-gagggcaattcaccatttcaccctcttgc 
:ttttgcttgtgctgacggtactcgacata 

AGAATGAATGAGCTCACTTTAATTGACTTCTAT^^ 



'tctggtctaaacga 

lCGGTACTATTACCG 

:tagcctggattatg 
"ttcctctggctctt 
'gactggcgggattg 
"tcaggctgtttgct 
ctccgggggacaat 
tcacttgcgaatgg 
ctacatcacgaacg 
gcatacaaccgcta 
tttgctagtacagt. 
attatcattatgag* 
attatttaagcctc 

CATAAAACGAACATi 
TCAGGAGTGTGTTA< 
CACCATTTCACCCT( 



™?^5^^E^ G ^ TT ^^ AA ^ TI -^ MCTCT TCTCAAGGAGTTCCTGATCTTCTGGTCTAAACGAACTAA 

CATGGCAGACAACGGT 

GTTTCCTATTCCTAGC 

ATAAAGCTTGTTTTCC 

AATTAATTGGGTGACT 

TCGTTGCTTCCTTCAG 

CTCAATGTGCCTCTCC 

GATCATTCGTGGTCAC 

AGATCACTGTGGCTAC 

TCAGGTTTTGCTGCAT. 

CGACAATATTGCTTTGi 

GCAGAGATATTGATTA 

AATAGTGAGACAATTA' 

AGTTAGATTATCCATA 

3CTATATCACTATCAG< 

TGACAATAAATTTGCACTAA^^ 



IATTATGTTACT 
K3CTCTTGTGGC 
K3GATTGCGATT 
^TTTGCTCGTAC 
GACAATTGTGA 
GAATGGCCGGA 
CGAACGCTTTC 
CCGCTACCGTA 
TACAGTAAGTG 
TATGAGGACTT 
AGCCTCTAACT 
GAACATGAAAA 



lCAGAATTAATTGGGTGACTGGCGGGATTGCGATT 
'ACTTCGTTGCTTCCTTCAGGCTGTTTGCTCGTA 
'CTTCTCAATGTGCCTCTCCGGGGGACAATTGTG. 

■tgtgatcattcgtggtcacttgcgaatggccgg. 

AAGAGATCACTGTGGCTACATCACGAACGCTTT< 
GATTCAGGTTTTGCTGCATACAACCGCTACCGT. 
CAACGACAATATTGCTTTGCTAGTACAGTAAGT* 
ATAGCAGAGATATTGATTATCATTATGAGGACT' 
TTCAATAGTGAGACAATTATTTAAGCCTCTAAC' 

TTATTCTCTTCCTGAC^^ 



CAGTAACACTTGCTTGTT^CTTGCTGCTGic'S 

GT1 
CAI 
TCI 

hGC 

AAGAAGAATTATTCGGAGTTAGATGATGAAGAACCTA^" ~ ^^^^^^^^^^^^^^^^CCTCTAACT 



lTTGTGA 
fGCCGGA 
'GCTTTC 
ACCGTA 
TAAGTG 

tcaggattgctatttggaatcttgacgttat^ 



'CACTTGCGAATGGCCGGA 

:tacatcacgaacgctttc 

rC AT ACAACCGCT ACCGTA 



aatgcttattatattttgg^ttca^^^ 

lTGAAACTTCTCATTGTTTTGACTTGTATTTCTC 
tfCTAATAAACCTCATGTGCTTGAAGATCCTTGT. 

•ggctttgtgctctaggaaaggttttaccttttc 
•gttactatcaactgtcaagatccagctggtggt! 

.CCAAACTGCTGCATTTAGAGACGTACTTGTTGT' 

.ccccaatcaaaccaacgtagtgccccccgcatt; 
gaatggaggacgcaatggggcaaggccaaaacac 
ggttcacagctctcactcagcatggcaaggagg^^ag^ 



^ggatctagaagaaccttgtaccaaagtctaaacg. 
'gcagttgcatatgcactgtag 

jgtacaacactaggggtaatac 

.gatggcacactatggttcaaa 

^cttatagctaggtgttggtac 

•aaataaacgaacaaattaaaa 



gtaatactto 
gttcaaaca1 
ttggtaccti 



'AAA 



cttogctttgtgctctaggaaaggt™^^ 

!^™S? C ?? ATAGCTAGGTGTTGGT ACCTTCATGAAGG 



T 
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CCMAMACAGTACMCGTCACTCASGCAWTG^^ 

gaccaagacctaatcagacaaggaactgattacaaa^^ 

AACTTCAAAATTCCATGAGTGGAGCTTCTGCTGATTCAACTCAGGCATAAAcS^ 
GGCAGATGGGCTATGTAAACGTTTTCGCAATTCCG 

ACATTAGGGAGGACTTGAAAGAGCCACCACATTTTCATCGAGGCGACGCGGAGTACGATCGAGGGTAPAPT 

CCATGTGATTTTAATAGCTTCTTAGGAGAATGACAAAAAAAAAAAAAAAAAAAAAAAA 
GenBank Accession No. AY274119 . 2 . ; SEQ ID NO: 2 



FIGURE 3P 



WO 2004/096842 PCT/CA2004/000626 

22/55 



ERV-2 

TOR2 

AIBV 



AIBV 



ERV-2 

TOR2 

AIBV 



ACACTCATCATGACCACACAAGGCAGATGGGCTATGTAAACGTT^ 



ERV-2 

TOR2 CGAT. 
AIBV 



ACATAGTCTACTCTTGTGCAGAATGAATTCTCGTAACTAAACAGCACAAGTAGGTT 



ERV-2 

TAGTTTAGTTTAAGTTAGTTTAG 
* * ** * 

TOR2 2 ^£^™^ C TCGCCGAGGCCACGCCGAGTAGGACCGAGGGTACAGC 

m^^^^^^ '^^'C^G'GCCACGCGGAGTACGATCGAGGGTACAGT- 

AGTAGGTATAAAGATGCCAGTGCCGGGGCCACGCGGAGTACGATCGAGGGTACAGCACTA 

* ** ******** ***** ** *********** 



~^^^T^F^^ '^^^^^^^~^^^'^^^^'3^^CG'J!QGGC'ITTCT — TTTGGTTTA 

;^^™ AMGAGAGCTCCCTATATCG ^ gagccct ^tgtaaaa™™a 

GGACGCCCATTAGGGGAAGA-GCTAAATTTTAGTTTAAGTTAAGTTTAA TTGGCTAA 



** * ** . , * 



ERV-2 CTTCTTC ---- 

GTATAGTTAAAATTTATAGGCTAGTATAGAGTTAGAGCA GeSanS : J W451 (SEQ ID NO: 32, 
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MFIFLLFLTLTSGSDLDRCTTFDDVQAPNYTQHTSSMRGVYYPDEIFRSD 

TLYLTQDLFLPFYSNVTGFHTINHTFGNPVIPFKDGIYFAATEKSNWRG 

WVFGSTMNNKSQSVIIINNSTNWIRACNFELCDNPFFAVSKPMGTQTHT 

MIFDNAFNCTFEYISDAFSLDVSEKSGNFKHLREFVFKNKDGFLYVYKGY 

QPIDWRDLPSGFNTLKPIFKLPLGINITNFRAILTAFSPAQDIWGTSAA 

AYFVGYLKPTTFMLKYDENGTITDAVDCSQNPLAELKCSVKSFEIDKGIY 
QT SNFRWP SGDWRF PNI TNLC PFGEVFNATKF PS VYAWERKKI SNCVA 

DYSVLYNSTFFSTFKCYGVSATKLNDLCFSNVYADSFWKGDDVRQIAPG 

QTGVIADYNYKLPDDFMGCVLAWNTRNIDATSTGNYNYKYRYLRHGKLRP 

FERDISNVPFSPDGKPCTPPALNCYWPLNDYGFYTTTGIGYQPYRVWLS 

FELLNAPATVCGPKLSTDLIKNQCVNFNFNGLTGTGVLTPSSKRFQPFQQ 

FGRDVSDFTDSVRDPKTSEILDISPCAFGGVSVITPGTNASSEVAVLYQD 

WCTDVSTAIHADQLTPAWRIYSTGNNVFQTQAGCLIGAEHVDTSYECDI 

PIGAGICASYHTVSLLRSTSQKSIVAYTMSLGADSSIAYSNNTIAIPTNF 

SI S ITTEVMPVSMAKTSVDCNMYICGDSTECANLLLQYGSFCTQLNRALS 

GIAAEQDRNTREVFAQVKQMYKTPTLKYFGGFNFSQILPDPLKPTKRSFI 

EDLLFNKVTLADAGFMKQYGECLGDINARDLICAQKFNGLTVLPPLLTDD 

MIAAYTAALVSGTATAGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYE 

NQKQIANQFNKAISQIQESLTTTSTALGKLQDWNQNAQALNTLVKQLSS 

NFGAISSVLNDILSRLDKVEAEVQIDRLITGRLQSLQTYVTQQLIRAAEI 

RAS ANL AATKMS ECVLGQ SKRVDFCGKGYHLMS F PQAAPHGWFLHVTYV 

PS QERNFTT AP AI CHEGKAYF PREGVFVFNGT SWF I TQRNFF S PQ 1 1 TTD 

NTFVSGNCDWIGIINNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGD 

ISGINASWNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYVWL 

GFIAGLIAIVMVTILLCCMTSCCSCLKGACSCGSCCKFDEDDSEPVLKGV 
KLHYT (SEQ ID NO: 33) 



Figures 



MADNGTITVEELKQLLEQWNLVIGFLFLAWIMLLQFAYSNRNRFLYIIKL 

VFLiWLLWPVTLACFVLAAVYRINWVTGGIAIAMACIVGLMWLSYFVASFR 

LFARTRSMWSFNPETNILLNVPLRGTIVTRPLMESELVIGAVIIRGHLRM 

AGHSLGRCDIKDLPKEITVATSRTLSYYKLGASQRVGTDSGFAAYNRYRI 
GNYKLNTDHAGSNDNIALLV (SEQ ID NO: 34) 



Figure 6 
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MYSPVSEETGTLIVNSVLLFLAFWFLLVTLAILTALRLCAYCCNIVNVS 
LVKPTVYVYSRVKNLNSSEGVPDLLV (SEQ ID NO: 35) 

Figure 7 



MDDFSRQLQNSMSGASADSTQA (SEQ ID NO: 36) 

Figure 8 
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BoCov 

OC43 

PHEV 

FCV 

TGEV 

T0R2J* 

ORF5 

AIBV2 

AIBV 



BoCov 

OC43 

PHEV 

FCV 

TGEV 

TOR2_M 

ORF5 

AIBV2 

AIBV 



BoCov 

OC43 

PHEV 

FCV 

TGEV 

TOR2_M 

ORF5 

AIBV2 

AIBV 



BoCov 

OC43 

PHEV 

FCV 

TGEV 

TOR2_M 

ORF5 

AIBV2 

AIBV 



BoCov 

OC43 

PHEV 

FCV 

TGEV 

TOR2_M 

ORF5 

AIBV2 

AIBV 



~~ ~ '~ MSSVTTPA P — VYTWTA0EAIKFLKEWNFSL 

ZZZ ~ MSSKTTPAP- -VYIWTADEAIKFLKEWNFSL 

~ MSSPTTPVP — VISWTADEAIKFLKEWNFSL 

~ MAD — NGTITVEELKQLLEQWNLVI 

ZZZ-Z1 ZZZ MAD — NGTITVEELKQLLEQWNLVI 

ZZZ-~ ~ _ " I MMEN CTLNLEQATLLFKEYNLFI 

MSNGTEN CTLSTQQAAELFKEYNLFI 

: : : : * : 

S??? 1 ^ ^^^^^^^^^^^^^^^M^LTIILTIFNOT — YALNN-VYLGLS 

c^™^ qygrpqf£ ^^ 

™ tr^f I ^ QYGRPQFSV ^ GI ^*^^^ 

GFLFLAWIMLLQFAYSNRNRFLYI IKL VPLVn^WPVTLACFVLAAV- - YRINW-VTGGI A 

GFLFLAWIMLLQFAYSNRNRFLYIIKLVFLWLLWPVTLACFVLAAV--YRINW-VTGGIA 

TAFLLFLTILLQYGYATRSRFIYILKMIVLWCFWPLNIAVGVISCI--YPPNT-GGLVAA 
TAFLLFLTILLQYGYATRSRFI YILKMIVLWCFWPIjNI AVGI I SCI — YPPNT-GGLVAA 
: • : :**:.. .*:*:*::.:*:**:. : : . * . . 

IVFTWAIIMWIVYFWSIRLFIRTC 

IVFTI VAI IMWIVYFVNS IRLFI RTGSFWSFNPETNNLMC IDMK- GTMYVRPI I EDYHTL 
I^TIVAIIMWVVYFWSIRLFIRTGSWWSFNPETNNI^CIDMK-GRMYVRPIIEDYHTL 
y^J^^^^kWMMYFVRS ^Q^YRRTKSWWSFNPETNAI LC VNAL - GRS YVLPLDGTPTGV 

^^^^^ 

^™^! FVGYWIQSCRLF ^ C ^ mSFNPESNAVGSILL ™^ 

IILTVPACLSFVGYWIQSFRLFKRCRSWWSFNPESNAVGSILLTNGQQCNFAIESVPMVL 
: : :: *:: * :* ; * * *****★.. . . * 

WTXIRGHL YMQGIKLGTG YSLSDLPAYVTVAKVSHLLTYKR- - -GFLDKIGDTSGFAVY 

TVTIIRGHLYIQGIKLGTGYSWADLPAYMTVAKVTHLCTYKR GFLDRISDTSGFAVY 

TATIIRGHLYIQGIKLGTGYSLSDLPAYVTVAKVTHLCTYKR GFLDRIGDTSGFAVY 

TLTLLSGNLYAEGFKMAGGLT I EHLPKYVMI RTPNRTI VYTLV — GKQLKATTATGWAYY 
TLTLLSGNL YAEGFKIAGGMNIDNLPK YVMVALPSRTI VYTLV- - GKKLKAS S ATGWAYY 
GAVI IRGHLRMAGHSLGR- CDIKDLPKEITVAT- SRTLSYYKL- -GASQRVGTDSGFAAY 
GAVI IRGHLRMAGHSLGR- CDIKDLPKEITVAT- SRTLSYYKL — GASQRVGTDSGFAAY 
3[ ^ ^^ GT ^^^ EG QWL AK ~ C E PDHLPKD I FVC T P DRRN I YRMVQ K YTGDQ SGNKKRVAT F 

SPIIKNGALYCEGQWI^-CEPDHLPKDIFVCTPDRRNIYRWQKYTGDQSGNKKRFATF 
: * * * : . ** . > . * 

VKS KVGNYRL P STQKG SGLDTALLRNNI 
VKSKVGNYRLPSTQKGSGMDTALLRNNI 
VKS KVGNYRL PSTHKGSGMDTALLRNNI 
VKSKAGDYSTEARTDNLSEHEKLLHMV- 
VKSKAGDYSTEARTDNLSEQEKLLHMV- 
NR YR I GNYKLNTD HAG SNDN I ALLVQ - - 

NRYRIGNYKLNTDHAGSNDNIALLVQ- - 

VYAKQSVDTGELESVPTGGSSLYT 

VYAKQ SVDTGELG SVATGG S SLYT 



Key 

PHEV 

BoCov 

AIBV 

TGEV 

FCV 

OC43 

AIBV2 



TOR2_M/ORF 5 



Name ^ ^ ^ 

Porcine heinagglutinating encephalomyelitis virus AAL80035 

matrix protein [Bovine coronavirus] . NP 150082 

membrane protein [Avian infectious bronchitis virus] . AAF35863 

?r^ ln CTransm f ssi ble gastroenteritis virus] . NP.058427 

membrane [feline coronavirus]. BAC01160 

membrane glycoprotein [Human coronavirus 0C43]. AAA45462 

rr^T^^JSS^ ^chitis virus] . AAK83027 



- ; ww^-.„„^ wa.wi.^iij.i.j. s virus j . AAKB3027 

sars associated coronavirus M glycoprotein (SEQ id NO: 34) 

Figure 9 
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(SEQ ID NO: 37) 
(SEQ ID NO: 38) 
(SEQ ID NO: 39) 
(SEQ ID NO: 40) 
(SEQ ID NO: 41) 
(SEQ ID NO: 42) 

(SEQ ID NO: 43) 
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BoCov 

OC43 

PHEV 

MHV 

AIBV2 

TCV 

AIBV 

FCV 

PTGV 

229E 

TOR2_N 



„f^°*°f T ? ~ SRASSGNRSGNGILK- - -WADQSDQSRNVQTRGRRVQS - -KOTATSOOP 
MSFVPGQENAGSRSSSVl^(^GILKKTTWADQTERGPmQ^ 

I.". ~ ~ MASGKAAGK- - -TDAPAPVIK LGGPKPP- -KVGSSGN- - 

I MASGKATGK TDAPAPIIK LGGPKPP- -KVGSSGN- - 

— MASGKAAGK TDAPAPVIK LGGPKPP — KVGSSGN-- 

~ ~" MATQGQRVN— WGDEPSKRR GRSNSR — GRKNNDI P - 

MANQGQRVS WGDESTKTR GRSNSR — GRKNNNIP- 

MATVK WADASEPQR G RQ GRIPYSL— 

MSDNGPQSNQRSAPRITFGGPTDSTDNNQNGGRNGARPKQRRPQGLPN 



BoCov 

OC43 

PHEV 

MHV 

AIBV2 

TCV 

AIBV 

FCV 

PTGV 

229E 

TOR2_N 



BoCov 

OC43 

PHEV 

MHV 

AIBV2 

TCV 

AIBV 

FCV 

PTGV 

229E 

TOR2_N 



BoCov 

OC43 

PHEV 

MHV 

AIBV2 

TCV 

AIBV 

FCV 

PTGV 

229E 

TOR2_N 



BoCov 

OC43 

PHEV 

MHV 

AIBV2 

TCV 

AIBV 

FCV 

PTGV 

229E 

TOR2 _N 



ssssssssssssssssssssssss^ 

AS wr-n c t xra K ^ Ij ^ ,I,P ^ PKFEGSG VPDNEN IKP S QQHG YWRR^AR- - FKPGKG 

WFQSIKAKKLNSPQPKFEGSGVPDNENIKTSQQHGYWRRQAR- - FKPQKC 
^^^^NAPAPKFEGSGVPDNmLKISQQHGYWRRQAR--YKPGKG 

FFNPITLQQGS™i^PRDFVPKGIGNR-DQQIGY^QTR--YRM^ 

SPLLVDS - EQPWKVIPRNLVPINKKDK-NKLIGYWNVQKR- - FRTRKG 
WFTALTQHG-KEELRFPRGQGVPINTNSGPDDQIGYYRRATRR-VRGGDG 



AS— 
AS — 
LS— 
LS— 
-Y— 
NTAS 



* * 



Mn^^f^^ GTCPI ^ QYG ™ IDG ^ ASN Q M VOTPADIVDRDPSSD^IPT 



KMKELSPRWFYYIXSTGPEASLPYGANKEGIVWATEGALNTPKDHIGT^P^^^ 

rfppgtvi,pqgy Y i E gs-grsapnsrstsrtssr A s S a---gsrsransg^---™g 

RFPPGTVLPQGYYIBGS-GRSAPNSRSTSRAPNRAPSA GSRSRANSGNR TSTPG 

rfapgtvlpqgfyvegs-grsapasrsgsrsqsrgp NN rars^S---qpI^ 

RFSM--GPDGNFR W DF-IPLKNRGRSG-RSTAASSAA---ASRAPSREGsL-4^D 

rfsdg--gpdsnprwdp-ip L h-rgrsg-rstaassaa---sspapsSr---g^r1g 

RFSDG--GPDGNFRWDF-IPLN-RGRSG-RSTAASSAA SSRAPSREGSR GRLNT 

KF^K-IPPQ FQLEVN R-SP^ S R S G S Q S RSV S Rms---4sSQS-- N S 

9 * 

VTPDMADQ1ASLVLAKLGKDAAKP QQVTKOTAKEIROK- TT 

VTPDMADQIASLVLAKLGKDATKP QQVTraTA^S 5" 

VTPDMADQIASLVLAKLGKDATKP-— Q^SS£"l£ 

VKPDMAEEIAALVLAKLGKDAGQP K^VTkSsaS^-IL 

SGDDIiIARAAKIIQDQQKKGS -MTkSZ S 

SEDDLIARAAKIIQDQQKKGS RziwSS^S^^ 

AEDDLIARAAKIIQDQQKKGS MTkSe^ «Y 

TIVAVLQKLGVTDK— QRSRSKS CE^SS^"™ 

AVLAALKKLGVYTEKQQQRSRSKS SsK Z™» 

LALLLLDRLNQLESKVSGKGQQCQG QTVTKKSAAEASKK--PR 

! : : 
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FDSTLSGFETIMKVIjNENL 
FDSTLSGFETIMKVLNENL 



NKPRQKRS PNKQCT VQQCFGKR GPNQNFGGGEMLKLGTSDPOFPIT.APT 

WK.FRgKRSPNKQCT— VQQCFGKR-— GPNQNFGGGEMLKLGTSDPOFPILAELAPTara 
NKPHQKRTP^QCP VQQCFGKR GPNQNFGGSEMLK^TSD^FPILAELAPTPSA 

r£ OTKomS ^^^^^^"K^KEGNFGDDKMNEEGIKDGRVTAMLNLVPSSHA 

CK- — rtvppgvs— idkvfgprt-kgkegnfgddkmneegikix;rvta»^vpsrha 

NKHTWKKTAGKGD VTNFYGAR SSSANFGDSDLVANGN^^CYPoSkECVPSVSS 

NKHTWKRTAGKGD VTRFYGTR — -SNSANFGDSDLVflNGSSMOTYPQLAECVPSVSS 

QK- RTATKQYN--VTQAFGRRGPEQTQGNFGDQDLIRQGTDYKHWPQIAQFAPSASA 

: : . :* * . ***. . * .::.*:: 

FFFGSRLELAKVQNLSGNLDEPQKDWELRYNGAIR 

FFFGSRLELAKVQNLSGNPDEPQKDVYELRYNGAIR 

PPres™^!™™ 1 ' 1 - FDSTLSGFETIMKVLNQNL 

FFFGSKLELVKKN— SGGADDPTKDVYELQYSGAIR FDSTLPGKPyTTMWTT mSktt 

CLFGSRVTPKLQL ^^HI»RFEFTTWKDDPQFDNYVKICDQCVDG^G^RPK^©EKCP 
CLFGSRVTPKLQP ^^^RFEFTTVVPRDDPQFDNYVTICDQCT^^IGTRPiSnEPRP 
CLFGSQVTPKIiQP ^^HLTFRFTTWSRDDPQFDNYVKICDEC^TOGVGTRPK^EVVRP 

ILFGSQWSAEEAG-DQVKVTI.THNYYLPKDDAKTS oft^t 

ILFGSYWTSKEDG--DQIEVTFTHKYHLPKDDPKTG nprn^T 

MLFDSHIVSKESG— NTWLTFTTRVTVPKDHPHLG "kptSSt 

FFGMSRIGMEVTP- - SGTWLTYHGAIKLDDKDPQFK DM V^LLNKHI 

• 

™ZS?S. EDGMMNISp KPQRQRG QKNGQVENDNVSVAAPKSRVQQNKSRELTAEDIS 

^eno22 GG ^*^'^^^^^^ ^Q^^^^^NVSVAKPKSSVQRNVSRELTPEDRS 
^BBS5DD»r^"^^^^ ^^^KKQDDEADKALTSDEERNNAQLEFYDE^-K 
KSR?«RDaTOrmo^^^ PK ^^KKPKKQDDEVDKALTSDEERNNAQLEFMEP-K 
DSYKRP^ ^EKKPKKQDDEVDKALTSDEERNNAQLEFDDEP-K 

SEVAKDQRQ RKSRSKSADKKPEELS— VTLEAYTDVFDDTOVE 

NAFTRE SEVA ^S RK ~ RKSRSKSAERSEQEWPDAL I ENYTDVFDDTQVE 

n»™™ZI MQQHP LLNPSALEFNPSQTSPATAEPVRDEVSIET-D 

DAYKTFPP ^ P ^^^KKTDEAQPLPQRQKKQPTVTLLPAADMDDFSRQLQNSMSG 

: * ... 

LLKKMDEP FTEDTSEI 

LLKKMDEP YTEDTSEI 

LLKKMDEP YTEDTSEI 

LLAQI LDDGWPDGLEDDSNV 

VINWGDAA LGENEL — 

VINWGDSA LGENHL — 

VINWGDSA LGENEL — 

MIDEVTN 

MIDEVTN 

IIDEVN 

ASADSTQA 



Key 
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BoCov 

AIBV 
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PTGV 

229E 
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TORJN 



NUCLEOCAPSID PROTEIN 

nucleocapsid protein [Bovine coronavirus] 
nucleocapsid protein [Avian infectious bronchitis virus] 
nucleocapsid [Feline coronavirus] . virus] . 

nucleoprotein [porcine transmissible gastroenteritis virus) 
nucleocapsid protein [Human coronavirus 229E1 J ' 

NUCLEOCAPSID PROTEIN. * 

nucleocapsid protein [porcine hemagglutinating encephalomyelitis] 
nucleocapsid protein [turkey coronavirus] . cepnaj.omyej.it is] 

SARS associated virus nucleocapsid protein (SEQ ID NO: 36) 



Genbank 
P18446 
NP_150083 
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AAM97563 
NP_073556 
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ATATTAC3GTTTTTACCTACCCAGGAAAAGCCAACCAACCTCGATCTCTTG 
TAGATCTGTTCTCTAAACGAACTTTAAAATCTGTGTAGCTGTCGCTCGGC 
TGCATGCCTAGTGCACCTACGCAGTATAAACAATAATAAATTTTACTGTC 
GTTGACAAGAAACGAGTAACTCGTCCCTCTTCTGCAGACTGCTTACGGTT 
TCGTCCGTGTTGCAGTCGATCATCAGCATACCTAGGTTTCGTCCGGGTGT 
GACCGAAAGGTAAGATGGAGAGCCTTGTTCTTGGTGTCAACGAGAAAAPA 

CACGTCCAACTCAGTTTGCCTGTCCTTCAGGTTAGAGACGTGCTAGTGCG 

tg«:ttc<^actctg^aagaggccctatcggaggScgtg^cIS 

TCAAAAATGGCACTTGTGGTCTAGTAGAGCTGGAAAAAGCKGTACTGCCC 
CAGCTTGAACAGCCCTATGTGTTCATTAAACGTTCTGATGCCTTAAGCAC 

CAATCACGGCCACAAGGTCGTTGAGCTGGTTGCAGAAATGGACGGCATTC 
AGTACCXSTCGTAGCGGTATAACACTG^ 

GAAACCCCAATTGCATACCGCAATGTTCTTCTTCGTAAGAACGGTAATAA 
GGGAGCCGGTGGTCATAGCTATGGCATCGATCTAAAGTCTTATGACTTAG 
GTGACGAGCTTGGCACTGATCCCATTGAAGATTATGAACAAAACTCGAAC 

ACTAAGCATGGCAGTGGTGCACTCCGTGAACTCACTCGTGAGCTCAATGG 
AGGTGCAGTCACTCGCTATGTCGACAACAATTTCTGTGGCCCAGATGGGT 
ACCCTCTTGATTGCATCAAAGATTTTCTCGCACGCGCGGGCAAGTCAATG 

CTGCTGCCGTGACCATGAGCATGAAATTGCCTGGTTCACTGAGCGCTCTG 
ATAAGAGCTACGAGCACCAGACACCCTTCGAAATTAAGAGTGCCAAGAAA 

tttgacactttcaaaggggaatgcccaaagtttgtgtttcctcttaaStt 

AAAAGTCAAAGTCATTCAACCACGTGTTGAAAAGAAAAAGACTGAGGGTT 

tcat^ggcgtatacgctctgtgtaccctgttgca^cIcagSSg? 

AACAATATGCACTTGTCTACCTTGATGAAATGTAATCATTGCGATGAACT 

ttcatggca<^cgtgcgactttctgaaagccacttgtgaacattotS?I 

CTCAAAATTTAGTTATTGAAG^ 

aatgctgtagtgaaaatgccatgtcctgcctgtcaagacccagagaotS 

ACCTGAGCATAGTGTTCCAGATTATCACAACCACTCAAACATTgStc 

gactccgcaagggaggtaggactagatgttttggaggctgtgtgtttgcc 
tatgttggctgctataataagcgtgcctactgggttcctcgtgctagtgc 

TGATATTGGCTCAGGCCATACTGGCATTACTGGTGACAATGTGGAGACCT 
TGAATGAGGATCTCCTTGAGATACTGAGTCGTGAACGTGTTAACATTAAC 

attgttggcgattttcatttgaatgaagaggttgccatcattttggcatc 

TTTCTCTGCTTCTACAAGTGCCTTTATTGACACTATAAAGAGTCTTGATT 
ACAAGTCTTTCAAAACCATTGTTGAGTCCTGCGGTAACTATAAAGTTACC 
AAGGGAAAGCCCGTAAAAGGTGCTTGGAACATTGGACAACAGAGATCAGT 
TTTAACACCACTGTGTGGTTTTCCCTCACAGGCTGCTGGTGTTATCAGAT 
CAATTTTTGCGCGCACACTTGATGCAGCAAACCACTCAATTCCTGATTTG 
CAAAGAGCAGCTGTCACCATACTTGATGGTATTTCTGAACAGTCATTACG 
TCTTCTCGACGCCAIX^TTTATACTTCAGACCTGCTCACCAAC^GTGTCA 
TTATTATGGCATATGTAACTGGTGGTCTTGTACAACAGACTTCTCAGTGG 
TTGTCTAATCTTTTGGGCACTACTGTTGAAAAACTCAGGCCTATCTTTGA 
ATGGATTGAGGCGAAACTTAGTGCAGGAGTTGAATTTCTCAAGGATGCTT 
™^ TTCTC ^ TTTCTCATTACAGGTGTTT ^ACATCGTCAAGGGT 
CAAATACAGGTTGCTTCAGATAACATCAAGGATTGTGTAAAATGCTTCAT 
TGATGTTGTTAACAAGGCACTCGAAATGTGCATTGATCAAGTCACTATCG 

ctggcgcaaagttgcgatcactcaacttaggtgaagtcttcatcgSc^ 

AGCAAGGGACTTTACCGTCAGTGTATACGTGGCAAGGAGCAGCTGCAACT 
ACTCATGCCTCTTAAGGCACCAAAAGAAGTAACCTTTCTTGAAGGTGATT 
CACATGACACAGTACTTACCTCTGAGGAGGTTGTTCTCAAGAACGGTGAA 
CTCGAAGCACTCGAGACGCCCGTTGATAGCTTCACAAATGGAGCTATCGT 
TGGCACACCAGTCTGTGTAAATGGCCTCATGCTCTTAGAGATTAAGGACA 
AAGAACAATACTGCGCATTGTCTCCTGGTTTACTGGCTACAAACAATGTC 

TOT^CTTAAAAGGGGGTCCaCC^lTAAAGGTGTAACCTTTCGAGAMA 
TACTGTTTGGGAAGTTCAAGGTTACAAGAATGTGAGAATCACATTTGAGC 
TTGATGAACGTGTTGACAAAGTGCTTAATGAAAAGTGCTCTGTCTACACT 
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GTTGAATCCGGTACCGAAGTTACTGAGTTTGCATGTGTTGTAGCAGAGrr 
TCTTGTGAAGACTTTACAACCAGTTTCTGATCTCCTTACCAACATGGGTA 

^gatcttgatgagtggagtgtac^tacattctacttatttoSga^S 

GGTGAAGAAAACTTTTCATCACGTATGTATTGTTCCTTTTACCCTCCAra 
TCAGGAAGAAGAGGACGATGCAGAGTGTGAGGAAGAAGAAATTGlTGAAt 

cctgtgaacatgagtacggtacagaggatgattatcaacmtctccctctS 



^!^^2?^^ CAGTT CGAGTTGAGGAAGAAGAAGAGGA 

AG 



agactggctggatgatactactgagcaatcagagattgagccagaaSag 
aacctacacctgaagaaccagttaatcagtttactggttattJaaaI? 



ttggacctaacctaaatgcaggtgaggacatccagcttcttaaSSSa 

TATGAAAATTTCAATTCACAGGACATCTTACTTGCACCATTGTTCTCAG? 
AGGCATATTTGGTGCTAAACCACTTCAGTCTTTACAAGTGTGCGTGCAGA 

cggttcgtacacaggtttatattgcagtcaatgacaaagctctttatgS 

CAGGTTGTCATGGATTATCTTGATAACCTGAAGCCTAGAGTGGAAGC^CC 
TAAACAAGAGGAGCCACCAAACACAGAAGATTCCAAAACTGAG^ 

GATGAGGTTACCACAACACTGGAAGAAACTAAGTTTCTTACCAATAAGTT 
ACTCTTGTTTGCTGATATCAATGGTAAGCTTTACCATGATTCTCAGAACA 



TGCTTAGAGGTGAAGATATGTCTTTCCTTGAGAAGGATGCACCTTACATG 
GTAGGTGATGTTATCACTAGTGGTGATATCACTTGTGTTGTAATACCCTC 
CAAAAAG ^ CTGGTGGCACTACTGA ^^^ 



^ C ^ TGATGAGTATATAACCACGTACC CTGGACAAGGATGTGC^T 

^tacacttgaggaagctaagactgctcttaagaaatgcaamctcS 

^ A ^ ACTACCTTCAGAAGCACCTAATGCT AAGGAAGAGATTC^ 

C ? G ™ CCTCGAATTTGAGAG ^ 

AAATTAATGCCTATATGCATGGATGTTAGAGCCATAATGGCAACCATCCA 

acgtaagtataaaggaattaaaattcaagagggcatcgttgactatggS 
tccgattcttcttttatactagtaaagagcctgtagcttctattatt?S 

AAGCTCAACTCTCTAAATGAGCCGCTTGTCACAATGCCAATTGGTTATC? 

gacacatggttttaatcttgaagaggctgcgcgctgtatgcgttctct?J 

AAGCTCCTGCCGTAGTGTCAGTATCATCACCAGATGCTGTTACTACAiA? 

aatggatacctcacttcgtcatcaaagacatctgaggagcactotgtagI 

aacagtttctttggctggctcttacagagattggtcctattcaggaSS 
gtacagagttaggtgttgaatttcttaagot 

cacactctggagagccccgtcgagtttcatcttgacggtgaggScStc 
acttgacaaactaaagagtctcttatccctgcgggaggttaagaSSS 
aagtgttcacaactgtggacaacactaatctccacac^ 

ATGTCTATGACATATGGACAGCAGTTTGGTCCAACATACTTGGATGGTGC 
S G ™ ACAAAAATTAAACCTCATGTAAA ^ATG^^ 

^^ ACTACCTAGTGATCACACACTACGTA ^gaagctttcgagtS 

catactcttgatgagagttttcttggtaggtacatgtctgctttaaacca 
cacaaagaaatggaaatttcctcaagttggtggtttaacttcaattaa^ 

GGGCTGATAACAATTCTTATTTGTCTAGTGTTTTATTAGCACTOCAA^G 

C ^ GC ^ TGATCCTGCTAACTTTTCTCCACT CATACTCGCTTACAGTA 
A ^^S TGTOGGCGAGCTTGGTCATGT CAGAGAAACTATGACC^ 

GTACATTCTTATGTGCGAATGAGTACACTGGTAACTATCAGTGTGGTCAT 
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TACACTCATATAACTGCTAAGGAGACCCTCTATCGTATTGACGGAGCTCA 
CCTTACAAAGATGTCAGAGTACAAAGGACCAGTGACTGATGTTTTCTACA 
AGGAAACATCTTACACTACAACCATCAAGCCTGTGTCGTATAAACTCGAT 
GGAGTTACTTACACAGAGATTGAACCAAAATTGGATGGGTATTATAAAAA 
GGATAATGCTTACTATACAGAGCAGCCTATAGACCTTGTACCAACTCAAC 
CATTACCAAATGCGAGTTTTGATAATTTCAAACTCACATGTTCTAACACA 
AAATTTGCTGATGATTTAAATCAAATGACAGGCTTCACAAAGCCAGCTTC 
ACGAGAGCTATCTGTCACATTCTTCCCAGACTTGAATGGCGATGTAGTGG 
CTATTGACTATAGACACTATTCAGCGAGTTTCAAGAAAGGTGCTAAATTA 
CTGCATAAGCCAATTGTTTGGCACATTAACCAGGCTACAACCAAGACAAC 
GTTCAAACCAAACACTTGGTGTTTACGTTGTCTTTGGAGTACAAAGCCAr 

TAGATACTTCAAATTCATTTGAAGTTCTGGCAGTAGAAGACACACAAGGA 

GGAAAATCCTACCATACAGAAGGAAGTCATAGAGTGTGACGTGAAAACTA 
CCGAAGTTGTAGGCAATGTCATACTTAAACCATCAGATGAAGGTGTTAAA 
GTAACACAAGAGTTAGGTCATGAGGATCTTATGGCTGCTTATGTGGAAAA 
CACAAGCATTACCATTAAGAAACCTAATGAGCTTTCACTAGCCTTAGGTT 
TAAAAACAATTGCCACTCATGGTATTGCTGCAATTAATAGTGTTCCTTGG 
AGTAAAATTTTGGCTTATGTCAAACCATTCTTAGGACAAGCAGCAATTAC 
AACATCAAATTGCGCTAAGAGATTAGCACAACGTGTGTTTAACAATTATA 

TGCCTTATGTGTTTACATTATTGTTCCAATTGTGTACTTTTACTAAAAGT 
ACCAATTCTAGAATTAGAGCTTCACTACCTACAACTATTGCTAAAAATAG 

TGTTAAGAGTGTTGCTAAATTATGTTTGGATGCCGGCATTAATTATGTGA 

AGTCACCCAAATTTTCTAAATTGTTCACAATCGCTATGTGGCTATTGTTG 

TTAAGTATTTGCTTAGGTTCTCTAATCTGTGTAACTGCTGCTTTTGGTGT 

ACTCTTATCTAATTTTGGTGCTCCTTCTTATTGTAATGGCGTTAGAGAAT 

TGTATCTTAATTCGTCTAACGTTACTACTATGGATTTCTGTGAAGGTTCT 

TTTCCTTGCAGCATTTGTTTAAGTGGATTAGACTCCCTTGATTCTTATCC 

AGCTCTTGAAACCATTCAGGTGACGATTTCATCGTACAAGCTAGACTTGA 

CAATTTTAGGTCTGGCCGCTGAGTGGGTTTTGGCATATATGTTGTTCACA 

AAATTCTTTTATTTATTAGGTCTTTCAGCTATAATGCAGGTGTTCTTTGG 

CTATTTTGCTAGTCATTTCATCAGCAATTCTTGGCTCATGTGGTTTATCA 

TTAGTATTGTACAAATGGCACCCGTTTCTGCAATGGTTAGGATGTACATC 

TTCTTTGCTTCTTTCTACTACATATGGAAGAGCTATGTTCATATCATGGA 

TGGTTGCACCTCTTCGACTTGCATGATGTGCTATAAGCGCAATCGTGCCA 

CACGCGTTGAGTGTACAACTATTGTTAATGGCATGAAGAGATCTTTCTAT 

GTCTATGCAAATGGAGGCCGTGGCTTCTGCAAGACTCACAATTGGAATTG 

TCTCAATTGTGACACATTTTGCACTGGTAGTACATTCATTAGTGATGAAG 

TTGCTCGTGATTTGTCACTCCAGTTTAAAAGACCAATCAACCCTACTGAC 

CAGTCATCGTATATTGTTGATAGTGTTGCTGTGAAAAATGGCGCGCTTCA 

CCTCTACTTTGACAAGGCTGGTCAAAAGACCTATGAGAGACATCCGCTCT 

CCCATTTTGTCAATTTAGACAATTTGAGAGCTAACAACACTAAAGGTTCA 

CTGCCTATTAATGTCATAGTTTTTGATGGCAAGTCCAAATGCGACGAGTC 

TGCTTCTAAGTCTGCTTCTGTGTACTACAGTCAGCTGATGTGCCAACCTA 

TTCTGTTGCTTGACCAAGCTCTTGTATCAGACGTTGGAGATAGTACTGAA 

GTTTCCGTTAAGATGTTTGATGCTTATGTCGACACCTTTTCAGCAACTTT 

TAGTGTTCCTATGGAAAAACTTAAGGCACTTGTTGCTACAGCTCACAGCG 

AGTTAGCAAAGGGTGTAGCTTTAGATGGTGTCCTTTCTACATTCGTGTCA 

GCTGCCCGACAAGGTGTTGTTGATACCGATGTTGACACAAAGGATGTTAT 

TGAATGTCTCAAACTTTCACATCACTCTGACTTAGAAGTGACAGGTGACA 

GTTGTAACAATTTCATGCTCACCTATAATAAGGTTGAAAACATGACGCCC 

AGAGATCTTGGCGCATGTATTGACTGTAATGCAAGGCATATCAATGCCCA 
AGTAGCAAAAAGTCACAATGTTTCACTCATCTGGAATGTAAAAGACTACA 

TCTCTTTATCTGAACAGCTGCGTAAACAAATTCGTAGTGCTGCCAAGAAG 
AACAACATACCTTTTAGACTAACTTGTGCTACAACTAGACAGGTTGTCAA 
TGTCATAACTACTAAAATCTCACTCAAGGGTGGTAAGATTGTTAGTACTT 
GTTTTAAACTTATGCTTAAGGCCACATTATTGTGCGTTCTTGCTGCATTG 
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GTTTGTTATATCGTTATGCCAGTACATACATTGTCAATCCATGATGGTTA 

cacaaatgaaatcattggttacaaagccattcagga^tgtSS?Stc 

ACATCATTTCTACTCATGATTGTTTTCCAAATAAAC^ 

GCATCMTTTAGCCAGCGTGGTGGTTCATACAAAAATGACAAAAGCTGCCC 

TGTAGTAGCTGCTATCATTACAAGAGAGATTGGTTTCATAGTGCCTGGCT 

TACCGGGTACTGTGCTGAGAGCAATCAATGGTGACTTCTTGCATTTTCTA 

CCTCGTGTTTTTAGTGCTGTTGGCAACATTTGCTACACACCTTCCAAACT 

CATTGAGTATAGTGATTTTGCTACCTCTGCTTGCGTTCTTGCTGCTGAGT 

GTACAATTTTTAAGGATGCTATGGGCAAACCTGTGCCATATTGTTATGAC 

actaatttgctagagggttctatttcttatagt^ 

TCGTTATGTGCTTATGGATGGTTCCATCATACAGTTTCCTAACACTTACC 
TGGAGGGTTCTGTTAGAGTAGTAACAACTTTTGATCCTGACTACTCTAGA 
CATGGTACATGCGAAAGGTCAGAAGTAGGTATTTG^TATCTACCAGTG 

GTGGTGTTGATGCGATGAATCTCATAGCTAACATCTTTACTCCTCTTCTG 

caacctgtgggtgctttagatgtgtctgcttcagtagtggcSgSa? 

tattgccatattggtgacttgtgctgcctactactttatgaaattcagac 

gtgtttttggtgagtacaaccatgttgttgctgctaatgcacttttgttt 

ttgatgtctttcactatactctgtctggtaccagcttacagctttctgcc 

gggagtctactcagtctoacttgtacttgacattctattSScStc 

atgtttcattcttggctcaccttcaatggtttgccatgttttctcctatt 

gtgcctttttggataacagcaatctatgtattctgtatttctctgaagca 

ctgccattggttctttaacaactatcttaggaaaagagtcatcS 

gagttacatttagtaccttcgaggaggctgctttgtgtacctttttgctc 

aacaaggaaatgtacctaaaattgcgtagcgagacactgttgccacttac 

acagtataacaggtatcttgctctatataacaagtacaagtatttcagtS 

gagccttagatactaco^atcgtc^c^gc^c^^SS 

aaggctctaaatgactttagcaactcaggtgctgatgttctctaccaac? 

accacagacatcaatcacttct(x:tgttctgcagagtggttotaggaaS 

gc^ctac^ctcttaatggattgtggttggatgac^cagtatactgtcc 
aagacatgtcatttgcacagcagaagacatgcttaatcctaactatgaag 

ATCTGCTCATTCGCAAATCCAACCATAGCTTTCTTGTTCAGGCTGGCAAT 

gttcaacttcgtgttattggccattctatgcaaaattgtctgcttagSct 

gcgtgtctttctgctatatgcatcatatggack:ttccaacaggaotacIJ 

aactgcacaggctgcaggtacagacacaaccataacattaaatc^tttcS 
catggctgtatgctgctgttatcaatggtgataggtggtoJaIS 

TTCACGA.CTACTTTGAATGACTTTAACCTTGTGGCAATGAAGrACAACTA 
TGAACCTTTGACACAAGATCATGTTGACATATTGGGACCTCTTTCTGCTC 
AAACAGGAATTGCCGTCTTAGATATGTGTGCTGCTTTGAAAGAGCTGCTG 
CAGAATGGTATGAATGGTCGTACTATCCTTGGTAGCACTATTTTAGAAGA 

tgagtttacaccatttgatgttgttagacaatgctctggtgtta?cttS 

AAGGTAAGTTC^GAAAATTGTTAAGGGCACTCATCATTGGATGCTTTTA 

actttcttcacatcactattgattcttgttcaaagtacacagtggtcIS 

GTTTTTCTTTGTTTACGAGAATGCTTTCTTGCCATTTACTCTTGGTATTA 
TGGCAATTGCTGCATGTGCTATGCTGCTTGTTAAGCATAAGCACGCATTC 
TTGTGCTTGTTTCTGTTACCTTCTCTTGCAACAGTTGCTTACTTTAATAT 
^^S^^^^^^^^^^^^^'^^^^^CGTATCATGACATGGCTTGAAT 
TGGCTGACACTAGCTTGTCTGGTTATAGGCTTAAGGATTGTGTTATGTAT 

gcttcagctttagttttgcttattctcatgacagctcgcactgttSSa 
tgatgctgctagacgtgtttggacactgatgaatgtcattacacttgto? 
acaaagtctactatggtaatgctttagatcaagctatttccatgtgggcc 
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TTAGTTATTTCTGTAACCTCTAACTATTCTGGTGTCGTTACGACTATCAT 
GTTTTTAGCTAGAGCTATAGTGTTTGTGTGTGTTGAGTATTACCCATTGT 
TATTTATTACTGGCAACACCTTACAGTGTATCATGCTTGTTTATTGTTTC 
TTAGGCTATTGTTGCTGCTGCTACTTTGGCCTTTTCTGTTTACTCAACCG 
TTACTTCAGGCTTACTCTTGGTGTTTATGACTACTTGGTCTCTACACAAG 

aatttaggtatatgaactcccaggggcttttgcctcc?aagactag5^^ 
gatcctttcaagcttaacattaagttgt^tattg^SSatc 

ctctggtactgctctcggttcttcaacaacttagagtagactcatcot 

aaattgtgggcacaatgtgtacaactccacaatgatattcttc??S 

agacacaactgaagctttcgagaagatcgtttctcttttgtctgttttr? 

tatccatgcagggtgctgtagacattaataggttgtgcgaggaaatgc?c 

gataaccgtgctactcttcaggctattgcttcagaatttagtoSSc 

atcatatgccgcttatgccactgcccaggaggcctatgagcagg?tc?S 

ctaatggtgatoctgaagtcgttctcaaaaagttaaagaaat^ 

gtggctaaatctgagtttgaccgtgatgctgccatgcaacgcaagttgga 

aaagatggcagatcaggctatgacccaaatgtacaaacag^gatcS 

aggacaagagggcaaaagtaactagtgctatgcaaacaatgcotSS 

atgcttaggaagcttgataatgatgcacttaacaacattatcaacaatgc 
gcgtgatggttgtgttccactcaacatcatac^ 

aactcatggttgttgtccctgattatggtacctaca^Sa^gS 
ggtaacacctttacatatgcatctgcactctgggaaatccagcaagS 

caccaaatttcgcttggcctcttattgttacagctctaagSaIct?! 

gctgttaaactacagaataatgaactgagtccagtagcactacSa^agS 

gtcctgt^ggctggtaccacacaaacagcttgtactgatgaS 

^gcctactataacaattcgaagggaggtaggtttgtgcJggca^Sa 

tcagaccaccaagatctcaaatgggctagattccctaagagtgatggtIc 

AGGTACAATTTACACAGAACTGGAACCACCTTGTAGGTTTGTTACAGACA 

caccaaaagggcctaaagtgaaatacttgtacttcatcaaaggcttaaac 

AACCTAAATAGAGGTATGGTGCTGGGCAGTTTAGCTGCTACAGTACGTCT 

TCTGTGCTTTTGCAGTAGACCCTGCTAAAGCATATAAGGATTACCTAG^ 
AGTGGAGGACAACCAATCACCAACTGTGTGAAGATGTTGTGTACA^ 

tggtacaggacaggcaattactgtaacaccagaagctaacatggaccaaS 
agtcctttggtggtgcttcatgttgtctgtattgtagatccS^aSSgI? 
catccaaatcctaaaggattctgtgacttgaaaggtaagtacgtccaaIt 

acctaccacttgtgctaatgacccagtgggttttacacttagaaaca^g 
^S^ ccgtctgcgg ^gtggaaagg T tatggctg T agtt^^ 

ctccgcgaacccttgatgcagtctgcggatgcatcaacgtttttaaacgg 
gtttgcggtgtaagtgcagcccgtcttacaccgtgcggcacaggc1?5S 
tactcatgtcgtctacagggctttt^tatttacaaSaaa^ 

gttttgcaaagttcctaaaaactaattgctgtcgcttccaggagaaSgS 
gaggaaggcaatttattagactcttactttgtagSaagag^^ 

^^^^^^^^^^^^^■^^^^actatttataacttggttaaagattgtc 
cagcggttgctgtccatgactttttcaagtttagagtaga^tgacatg 
gtaccacatatatcacgtcagcgtctaactaaatacacaaSS?? 

AAATACTCGTCACATACAATTGCTGTGATGATGATTATTTCAATAAGAAG 

GATTGGTATGACTTCGTAGAGAATCCTGACATCTTACGCGTATATGCTAA 
CTTAGGTGAGCGTGTACGCCAATCATTATT^ 

atgctatgcgtgatgcaggcatigtaggcgtactgacatta^SSS 

GATCTTAATGGGAACTGGTACGATTTCGGTGATTTCGTACAAGTAG^ 
AGGCTGCGGAGTTCCTATTGTGGATTCATATTACTCATTGCTGATGCcS 

tcctcactttgactagggcattggctgctgagtcccatai^atgctSS 

CTCGCAAAACCACTTATTAAGTGGGATTTGCTGAAATATGATTOTACGGA 

agagagactttgtctcttcgaccgttattttaaatattgggIS 
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ATCCCAATTGTATTAACTGTTTGGATGATAGGTGTATCCTTCATTGT 

gcaaactttaatgtcttattttctactgtgtttccacctacSgt?SS 
accactagtaagaaaaatatttgtagatggtgttccttttcttcSS? 
ctggataccattttcgtgagttaggagtcgtacataatcaSatc?^ 
ttacatagctcgcgtctcagtttcaaggaacttttSta^Sgc^ 

TCCAGCTATGCATGCAGCTTCTGGCAATTTATTGCTAGATAAACGCACTA 
CATGCTTTTCAGTAGCTGCACTAACAAACAATGTOGCTTTTCAAACTGTC 

aaacccggtaattttaataaagacttttatgactttgctgtgtctaSgS 

TTTCTTTAAGGAAGGAAGTTCTGTTGAACTAAAACACTTCTTCTTTGCTC 

aggatggcaacgctgctatcagtgattatgactattatcgttataatctS 
ccaacaatgtgtgatatcagacaactcctattcgtagttgaagttgtoS 

TAAATACTTTGATTGTTACGATGGTGGCTGTATTAATGCCAACCAAGTAA 
TCGTTAACAATCTGGATAAATCAGCTGGTTTCCCATTTAATAAATCGGCT 

^^ TAGAC ™ otat ^ ctc ^ tc agttatgaggatc^tcS 

toto a^^^^^^^^^^^^^^^^^^^^^^^^^A^^A^AACTCAAATGA^TC 
TTAAGTATGCCATTAGTGCAAAGAATAGAGCTCGCACCGTAGCTGGTGTC 

tctatctgtagtactatc^caaatagac^gtttcatcaSSa?^ 
gtcaatagccgccactagaggagctactgtggtaattggaacaagcSgt 
tttacggtggctggcataatatgttaaaaactgtttacagtgatgtJSaI 
actccacaccttatgggttgggattatccaaaatgtgacagagccatgcc 

TAACATGCTTAGGATAATGGCCTCTCTTGTTCTTGCTCGCAAACATAACA 

CAAGTATTAAGTGAGATGGTCATGTGTGGCGGCTCACTATATGTTAAACC 
AGGTGGAACATCATCCGGTGATGCTACAACTGCTTATGCTAATAGTGTCT 
TTAACATTTGTCAAGCTGTTACAGCCAATGTAAATGCACTTCTTTCAACT 
?„^^^^ AA ^ AA ^A^^^^A < ^A^A^QTCCGCAATCTACAACACAGGCT 
CTATGAGTGTCTCTATAGAAATAGGGATGTTGATCATGAATTCGTGGATG 
AGTTTTACGCTTACCTGCGTAAACATTTCTCCATGATGATTCTTTCTGAT 
GATGCCGTTGTGTGCTATAACAGTAACTATGCGGCTCAAGGTTTAGTAGC 
TAGCATTAAGAACTTTAAGGCAGTTCTTTATTATCAAAATAATGTGTTCA 
TGTCTGAGGCAAAATGTTGGACTGAGACTGACCTTACTAAAGGACCTCAC 
GAATTTTGCTCACAGCATACAATCCTAGTTAAACAAGGAGATGATTACGT 

gtacctgccttacccagatccatcaagaatattaggcgcaggctgttStg 

TCGATGATATTGTCAAAACAGATGGTACACTTATGATTGAAAGGTTCGTC 
^S^^^^^^^^^^^^^^^^^ACTTACAAAACATCCTAATCAG^AGTA 

tcctgatgtctttcacttgtatttacaatacattagaaagttacatgatS 

AGC ™ AC ^ GCCACATC ?^ 

AACACCTCACGGTACTGGGAACCTGAGTTTTATGAGGCTATGTACACACr 
ACATACAGTCTTGCAGGCTGTAGGTGCTTGTGTATTGTGCAATTCACAGA 

cttcacttcgttgcggtgcctgtattaggagaccattcctatgtScaag 
tgctoctatgaccatgtcatttcaacatcac^caaattagtgtSctc? 

TAATCCCTATGTTTGCAATGCCCCAGGTTGTGATGTCACTGATGTGACAC 
AACTGTATCTAGGAGGTATGAGCTATTATTGCAAGTCACATAAGCCTCCP 
ATTAGTTTTCCATTATGTGCTAATGGTCAGGTTTTTGGTTTATACAAAAA 
CACATGTGTAGGCAGTGACAATGTCACTGACTTCAATGCGATAGCAACAT 
GTGATTGGACTAATGCTGGCGATTACATACTTGCCAACACTTGTACTGAG 
AGACTCAAGCTTTTCGCAGCAGAAACGCTCAAAGCCACTGAGGAAACATT 
TAAGCTGTCATATGGTATTGCCACTGTACGCGAAGTACTCTCTGACAGAG 
AATTGCATCTTTCATGGGAGGTTGGAAAACCTAGACCACCATTGAACAGA 
AACTATGTCTTTACTGGTOACCGTGTAACTAAAAATAGTAAAGTA^S 

tggagagtacacctttgaaaaaggtgactatcgtgatgctgttgtctSa 

GAGGTACTACGACATACAAGTTGAATGTTGGTGATTACTTTGTGTTGACA 
TCTCACACTGTAATGCCACTTAGTGCACCTACTCTAGTGCCACAAGAGCA 

ctatgtgagaattactggcttgtacccaacactcaacatctSgISJ 

TTTCTAGCAATGTTGCAAATTATCAAAAGGTCGGCATGCAAAAGTACTCT 
ACACTCCAAGGACCACCTGGTACTGGTAAGAGTCATTTTGCCATCGGACT 
TGCTCTCTATTACCCATCTCCTCGCATAGTGTATACGGCMGCTCTC^ 
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CAGCTGTTGATGCCCTATGTGAAAAGGCATTAAAATATTTGCCCATAGAT 
AAATGTAGTAGAATCATACCTGCGCGTGCGCGCGTAGAGTGTTTTGATAA 
ATTCAAAGTGAATTCAACACTAGAACAGTATGTTTTCTGCACTGTAAATG 
CATTGCCAGAAACAACTGCTGACATTGTAGTCTTTGATGAAATCTCTATR 
GCTACTAATTATGACTTGAGTGTTGTCAATGCTAGACTTCGTGCAAAACA 
CTACGTCTATATTGGCGATCCTGCTCAATTACCAGCCCCCCGCACATTGC 
TGACTAAAGGCACACTAGAACCAGAATATTTTAATTCAGTGTGCAGACTT 
ATGAAAACAATAGGTCCAGACATGTTCCTTGGAACTTGTCGCCGTTGTCC 
TGCTGAAATTGTTGACACTGTGAGTGCTTTAGTTTATGACAATAAGCTAA 
AAGCACACAAGGATAAGTCAGCTCAATGCTTCAAAATGTTCTACAAAGGT 

GTTATTACACATGATGTTTCATCTGCAATCAACAGACCTCAAATAGGCGT 
TGTAAGAGAATTTCTTACACGCAATCCTGCTTGGAGAAAAGCTGTTTTTA 
TCTCACCTTATAATTCACAGAACGCTGTAGCTTCAAAAATCTTAGGATTG 
CCTACGCAGACTGTTGATTCATCACAGGGTTCTGAATATGACTATGTCAT 
ATTCACACAAACTACTGAAACAGCACACTCTTGTAATGTCAACCGCTTCA 
ATGTGGCTATCACAAGGGCAAAAATTGGCATTTTGTGCATAATGTCTGAT 

AGAGATCTTTATGACAAACTGCAATTTACAAGTCTAGAAATACCACGTCG 
CAATGTGGCTACATTACAAGCAGAAAATGTAACTGGACTTTTTAAGGACT 
GTAGTAAGATCATTACTGGTCTTCATCCTACACAGGCACCTACACACCTC 
AGCGTTGATATAAAGTTCAAGACTGAAGGATTATGTGTTGACATACCAGG 
CATACCAAAGGACATGACCTACCGTAGACTCATCTCTATGATGGGTTTCA 
AAATGAATTACCAAGTCAATGGTTACCCTAATATGTTTATCACCCGCGAA 
GAAGCTATTCGTCACGTTCGTGCGTGGATTGGCTTTGATGTAGAGGGCTG 
TCATGCAACTAGAGATGCTGTGGGTACTAACCTACCTCTCCAGCTAGGAT 

tttctacaggtgttaacttagtagctgtaccgactggttatgttgaSact 

GAAAATAACACAGAATTCACCAGAGTTAATGCAAAACCTCCACCAGGTGA 
CCAGTTTAAACATCTTATACCACTCATGTATAAAGGCTTGCCCTGGAATG 
TAGTGCGTATTAAGATAGTACAAATGCTCAGTGATACACTGAAAGGATTG 
TCAGACAGAGTCGTGTTCGTCCTTTGGGCGCATGGCTTTGAGCTTACATC 
AATGAAGTACTTTGTCAAGATTGGACCTGAAAGAACGTGTTGTCTGTGTG 
ACAAACGTGCAACTTGCTTTTCTACTTCATCAGATACTTATGCCTGCTGG 
AATCATTCTGTGGGTTTTGACTATGTCTATAACCCATTTATGATTGATGT 
TCAGCAGTGGGGCTTTACGGGTAACCTTCAGAGTAACCATGACCAACATT 
GCCAGGTACATGGAAATGCACATGTGGCTAGTTGTGATGCTATCATGACT 
AGATGTTTAGCAGTCCATGAGTGCTTTGTTAAGCGCGTTGATTGGTCTGT 
TGAATACCCTATTATAGGAGATGAACTGAGGGTTAATTCTGCTTGCAGAA 
AAGTACAACACATGGTTGTGAAGTCTGCATTGCTTGCTGATAAGTTTCCA 

GTTCTTCATGACATTGGAAATCCAAAGGCTATCAAGTGTGTGCCTCAGGC 
TGAAGTAGAATGGAAGTTCTACGATGCTCAGCCATGTAGTGACAAAGCTT 
^^^^^^TCT^TATTCTTATGCTACACATCACGATAAATTC 
ACTGATGGTGTTTGTTTGTTTTGGAATTGTAACGTTGATCGTTACCCAGC 
CAATGCAATTGTGTGTAGGTTTGACACAAGAGTCTTGTCAAACTTGAACT 
TACCAGGCTGTGATGGTGGTAGTTTGTATGTGAATAAGCATGCATTCCAC 
ACTCCAGCTTTCGATAAAAGTGCATTTACTAATTTAAAGCAATTGCCTTT 
CTTTTACTATTCTGATAGTCCTTGTGAGTCTCATGGCAAACAAGTAGTGT 
CGGATATTGATTATGTTCCACTCAAATCTGCTACGTGTATTACACGATGC 
™J^ TGGTGCTGTTTCCAGACACCA TOCAAATGAGTACCGACAGTA 
CTTGGATGCATATAATATGATGATTTCTGCTGGATTTAGCCTATGGATTT 
ACAAACAATTTGATACTTATAACCTGTGGAATACATTTACCAGGTTACAG 
AGTTTAGAAAATGTGGCTTATAATGTTGTTAATAAAGGACACTTTGATGG 
ACACGCCGGCGAAGCACCTGTTTCCATCATTAATAATGCTGTTTACACAA 
AGGTAGATGGTATTGATGTGGAGATCTTTGAAAATAAGACAACACTTCCT 
GTTAATGTTGCATTTGAGCTTTGGGCTAAGCGTAACATTAAACCAGTGCC 
AGAGATTAAGATACTCAATAATTTGGGTGTTGATATCGCTGCTAATACTG 
TAATCTGGGACTACAAAAGAGAAGCCCCAGCACATGTATCTACAATAGG? 

gtctgcacaatgactc^catt^caagaaacctactgagagtgS??? 

TTCACTTACTGTCTTGTTTGATGGTAGAGTGGAAGGACAGGTAGACCTTT 
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TTAGAAACGCCCGTAATGGTGTTTTAATAACAGAAGGTTCAGTCAAAGGT 

CTAACACCTTCAAAGGGACCAGCACAAGCTAGCGTCAATGGAGTCACATT 

AATTGGAGAATCAGTAAAAACACAGTTTAACTACTTTAAGAAAGTAGACG 

GCATTATTCAACAGTTGCCTGAAACCTACTTTACTCAGAGCAGAGACTTA 

GAGGATTTTAAGCCCAGATCACAAATGGAAACTGACTTTCTCGAGCTCGC 

TATGGATGAATTCATACAGCGATATAAGCTCGAGGGCTATGCCTTCGAAC 

ACATCGTTTATGGAGATTTCAGTCATGGACAACTTGGCGGTCTTCATTTA 

ATGATAGGCTTAGCCAAGCGCTCACAAGATTCACCACTTAAATTAGAGGA 

TTTTATCCCTATGGACAGCACAGTGAAAAATTACTTCATAACAGATGCGC 

AAACAGGTTCATCAAAATGTGTGTGTTCTGTGATTGATCTTTTACTTGAT 

GACTTTGTCGAGATAATAAAGTCACAAGATTTGTCAGTGATTTCAAAAGT 

GGTCAAGGTTACAATTGACTATGCTGAAATTTCATTCATGCTTTGGTGTA 

AGGATGGACATGTTGAAACCTTCTACCCAAAACTACAAGCAAGTCAAGCG 

TGGCAACCAGGTGTTGCGATGCCTAACTTGTACAAGATGCAAAGAATGCT 

TCTTGAAAAGTGTGACCTTCAGAATTATGGTGAAAATGCTGTTATACCAA 

AAGGAATAATGATGAATGTCGCAAAGTATACTCAACTGTGTCAATACTTA 

AATACACTTACTTTAGCTGTACCCTACAACATGAGAGTTATTCACTTTGG 

TGCTGGCTCTGATAAAGGAGTTGCACCAGGTACAGCTGTGCTCAGACAAT 

GGTTGCCAACTGGCACACTACTTGTCGATTCAGATCTTAATGACTTCGTC 

TCCGACGCAGATTCTACTTTAATTGGAGACTGTGCAACAGTACATACGGC 

TAATAAATGGGACCTTATTATTAGCGATATGTATGACCCTAGGACCAAAC 

ATGTGACAAAAGAGAATGACTCTAAAGAAGGGTTTTTCACTTATCTGTGT 

GGATTTATAAAGCAAAAACTAGCCCTGGGTGGTTCTATAGCTGTAAAGAT 

AACAGAGCATTCTTGGAATGCTGACCTTTACAAGCTTATGGGCCATTTCT 

CATGGTGGACAGCTTTTGTTACAAATGTAAATGCATCATCATCGGAAGCA 

TTTTTAATTGGGGCTAACTATCTTGGCAAGCCGAAGGAACAAATTGATGG 

CTATACCATGCATGCTAACTACATTTTCTGGAGGAACACAAATCCTATCC 

AGTTGTCTTCCTATTCACTCTTTGACATGAGCAAATTTCCTCTTAAATTA 

AGAGGAACTGCTGTAATGTCTCTTAAGGAGAATCAAATCAATGATATGAT 

TTATTCTCTTCTGGAAAAAGGTAGGCTTATCATTAGAGAAAACAACAGAG 

TTGTGGTTTCAAGTGATATTCTTGTTAACAACTAAACGAACATGTTTATT 

TTCTTATTATTTCTTACTCTCACTAGTGGTAGTGACCTTGACCGGTGCAC 

CACTTTTGATGATGTTCAAGCTCCTAATTACACTCAACATACTTCATCTA 

TGAGGGGGGTTTACTATCCTGATGAAATTTTTAGATCAGACACTCTTTAT 

TTAACTCAGGATTTATTTCTTCCATTTTATTCTAATGTTACAGGGTTTCA 

TACTATTAATCATACGTTTGGCAACCCTGTCATACCTTTTAAGGATGGTA 

TTTATTTTGCTGCCACAGAGAAATCAAATGTTGTCCGTGGTTGGGTTTTT 

GGTTCTACCATGAACAACAAGTCACAGTCGGTGATTATTATTAACAATTC 

TACTAATGTTGTTATACGAGCATGTAACTTTGAATTGTGTGACAACCCTT 

TCTTTGCTGTTTCTAAACCCATGGGTACACAGACACATACTATGATATTC 

GATAATGCATTTAATTGCACTTTCGAGTACATATCTGATGCCTTTTCGCT 

TGATGTTTCAGAAAAGTCAGGTAATTTTAAACACTTACGAGAGTTTGTGT 

TTAAAAATAAAGATGGGrTTCTCTATGTTTATAAGGGCTATCAACCTATA 

GATGTAGTTCGTGATCTACCTTCTGGTTTTAACACTTTGAAACCTATTTT 

TAAGTTGCCTCTTGGTATTAACATTACAAATTTTAGAGCCATTCTTACAG 

CCTTTTCACCTGCTCAAGACATTTGGGGCACGTCAGCTGCAGCCTATTTT 

GTTGGCTATTTAAAGCCAACTACATTTATGCTCAAGTATGATGAAAATGG 

TACAATCACAGATGCTGTTGATTGTTCTCAAAATCCACTTGCTGAACTCA 

AATGCTCTGTTAAGAGCTTTGAGATTGACAAAGGAATTTACCAGACCTCT 

AATTTCAGGGTTGTTCCCTCAGGAGATGTTGTGAGATTCCCTAATATTAC 

AAACTTGTGTCCTTTTGGAGAGGTTTTTAATGCTACTAAATTCCCTTCTG 

TCTATGCATGGGAGAGAAAAAAAATTTCTAATTGTGTTGCTGATTACTCT 

GTGCTCTACAACTCAACATTTTTTTCAACCTTTAAGTGCTATGGCGTTTC 

TGCCACTAAGTTGAATGATCTTTGCTTCTCCAATGTCTATGCAGATTCTT 

TTGTAGTCAAGGGAGATGATGTAAGACAAATAGCGCCAGGACAAACTGGT 

GTTATTGCTGATTATAATTATAAATTGCCAGATGATTTCATGGGTTGTGT 

CCTTGCTTGGAATACTAGGAACATTGATGCTACTTCAACTGGTAATTATA 
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ATTATAAATATAGGTATCTTAGACATGGCAAGCTTAGGCCCTTTGAGAGA 

gacatatctaatgtgcctttctcccctgatggcaaSSgcIScScc 
tgctcttaattgttattggccattaaatgattatggtttttacaccacta 
ctggcattggctaccaaccttacagagttgtagtactttcttttgaaott 
ttaaatgcaccggccacggtttgtggaccaaaattatccactgacctSS 
taagaaccagtgtgtcaattttaattttaatggactcactggtactggtg 
tgttaactccttctocaaagagatttcaaccatttcaacaatSggccgt 
gatgtttctgatttcactgattccgttcgagatcctaaaacatctgaaat 
attagacatttcaccttgcgcttttgggggtgtaagtgtaattacac^g 
gaacaaatgcttcatctgaagttgctgttctatat^gatgttaacS? 

ACTGATGTTTCTACAGCAATTCATGCAGATCAACTCACACCAGCTTGGCr 
CATATATTCTACTGGAAACAATGTATTCCAGACTCAAGCAGGCTGTCTTA 
^^??^^^^^^^^^^^^^ A ^^^^'^ I A'I I< 3AGTGCGACATTCCTATTGGA 
GCTCGCATTTGTGCTAGTTACCATACAGTTTCTTTATTACGTAGTACTAG 
CCAAAAATCTATTGTGGCTTATACTATGTCTTTAGGTGCTGATAGTTCAA 

^gcttactctaataacaccattgctatacctactaacttttcaaSagc 

ATTACTACAGAAGTAATGCCTGTTTCTATGGCTAAAACCTCCGTAGATTG 
TAATATGTACATCTGCGGAGATTCTACTGAATGTGCTAATTTGCTTCTCP 
AATATGGTAGCTTTTGCACACAACTAAATCGTGCACTCTCAGGTATTGCT 
GCTGAACAGGATCGCAACACACGTGAAGTGTTCGCTCAAGTCAAACAAAT 
GTACAAAACCCCAACTTTGAAATATTTTGGTGGTTTTAATTTTTCACAAA 

TATTACCTGACCCTCTAAAGCCAACTAAGAGGTCTTTTATTGAGGACTTG 
CTCTTTAATAAGGTGACACTCGCTGATGCTGGCTTCATGAAGCAATATGC 
CGAATGCCTAGGTGATATTAATGCTAGAGATCTCATTTGTGCGCAGAAGT 
TCAATGGACTTACAGTGTTGCCACCTCTGCTCACTGATGATATGATTGCT 
GCCTACACTGCTGCTCTAGTTAGTGGTACTGCCACTGCTGGATGGACATT 
TCGTGCTGGCGCTGCTCTTCAAATACCTTTTGCTATGCAAATGGCATATA 
GGTTCAATGGCATTGGAGTTACCCAAAATGTTCTCTATGAGAACCAAAAA 
CAAATCGCCAACCAATTTAACAAGGCGATTAGTCAAATTCAAGAATCACT 
TACAACAACATCAACTGCATTGGGCAAGCTGCAAGACGTTGTTAACCAGA 
ATGCTCAAGCATTAAACACACTTGTTAAACAACTTAGCTCTAATTTTGGT 
GCAATTTCAAGTGTGCTAAATGATATCCTTTCGCGACTTGATAAAGTCGA 
GGCGGAGGTACAAATTGACAGGTTAATTACAGGCAGACTTCAAAGCCTTC 
AAACCTATGTAACACAACAACTAATCAGGGCTGCTGAAATCAGGGCTTCT 
GCTAATCTTGCTGCTACTAAAATGTCTGAGTGTGTTCTTGGACAATCAAA 
AAGAGTTGACTTTTGTGGAAAGGGCTACCACCTTATGTCCTTCCCACAAG 
CAGCCCCGCATGGTGTTCTCTTCCTACATGTCACGTATGTGCCATCCclS 
GAGAGGAACTTCACCACAGCGCCAGCAATTTGTCATGAAGGCAAAGCATA 
CTTCCCTCGTGAAGGTGTTTTTGTGTTTAATGGCACTTCTTGGTTTATTA 

CACAGAGGAACTTCTTTTCTCCACAAATAATTACTACAGACAATACATTT 
GTCTCAGGAAATTGTGATGTCGTTATTGGCATCATTAACAACACAGTTTA 
TGATCCTCTGCAACCTGAGCTTGACTCATTCAAAGAAGAGCTGGACAAGT 
ACTTCAAAAATCATACATCACCAGATGTTGATCTTGGCGACATTTCAGGC 
ATTAACGCTTCTGTCGTCAACATTCAAAAAGAAATTGACCGCCTCAATGA 
GGTCGCTAAAAATTTAAATGAATCACTCATTGACCTTCAAGAATTGGGAA 
AATATGAGCAATATATTAAATGGCCTTGGTATGTTTGGCTCGGCTTCATT 
GCTGGACTAATTGCCATCGTCATGGTTACAATCTTGCTTTGTTGCATGAC 
TAGTTGTTGCAGTTGCCTCAAGGGTGCATGCTCTTGTGGTTCTTGCTGCA 
AGTTTGATGAGGATGACTCTGAGCCAGTTCTCAAGGGTGTCAAATTACAT 
TACACATAAACGAACTTATGGATTTGTTTATGAGATTTTTTACTCTTAGA 

TCAATTACTGCACAGCCAGTAAAAATTGACAATGCTTCTCCTGCAAGTAC 

tgttcatgctacagcaacgataccgctacaagcctcactccctttcSgat 

GGCTTGTTATTGGCGTTGCATTTCTTGCTGTTTTTCAGAGCGCTACCAAA 
ATAATTGCGCTCAATAAAAGATGGCAGCTAGCCCTTTATAAGGGCTTCCA 
GTTCATTTGCAATTTACTGCTGCTATTTGTTACCATCTATTCACATCTTT 
TGCTTGTCGCTGCAGGTATGGAGGCGCAATTTTTGTACCTCTATGCCTTG 

ATATATTTTCTACAATGCATCAACGCATGTAGAATTATTATGAGATGTTG 
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GCTTTGTTGGAAGTGCAAATCCAAGAACCCATTACTTTATGATGCCAACT 
ACTTTGTTTGCTGGCACACACATAACTATGACTACTGTATACCATATAAC 
AGTGTCACAGATACAATTGTCGTTACTGAAGGTGACGGCATTTCAACACC 
AAAACTCAAAGAAGACTACCAAATTGGTGGTTATTCTGAGGATAGGCACT 
CAGGTGTTAAAGACTATGTCGTTGTACATGGCTATTTCACCGAAGTTTAC 
m^S^^^^^^^^^^^^^^^^^^^^^^^^^^QGTATTGAAAATGC 
TACATTCTTCATCTTTAACAAGCTTGTTAAAGACCCACCGAATGTGCAAA 
TACACACAATCGACGGCTCTTCAGGAGTTGCTAATCCAGCAATGGATCCA 
ATTTATGATGAGCCGACGACGACTACTAGCGTGCCTTTGTAAGCACAAGA 
AAGTGAGTACGAACTTATGTACTCATTCGTTTCGGAAGAAACAGGTACGT 
TAATAGTTAATAGCGTACTTCTTTTTCTTGCTTTCGTGGTATTCTTGCTA 

GTCACACTAGCCATCCTTACTGCGCTTCGATTCTGTGCGTACTGCTGCAA 
TATTGTTAACGTGAGTTTAGTAAAACCAACGGTTTACGTCTACTCGCGTG 
TTAAAAATCTGAACTCTTCTGAAGGAGTTCCTGATCTTCTGGTCTAAACG 

GCAGACAACGGTACTATTACCGTTGAGGAGCTTAAACAACTCCTGGAACA 
ATGGAACCTAGTAATAGGTTTCCTATTCCTAGCCTGGATTATGTTACTAC 
AATTTGCCTATTCTAATCGGAACAGGTTTTTGTACATAATAAAGCTTGTT 
TTCCTCTGGCTCTTGTGGCCAGTAACACTTGCTTGTTTTGTGCTTGCTGr 

GTATTGTAGGCTTGATGTGGCTTAGCTACTTCGTTGCTTCCTTCAGGPTP 
TTTGCTCGTACCCGCTCAATGTGGTCATTCAACCCAGAAACAAACATTC? 
TCTCAATGTGCCTCTCCGGGGGACAATTGTGACCAGACCGCTCATGGAAA 
GTCAACTTGTCATTGGTGCTGTGATCATTCGTGGTCACTTGCGAATGGCC 
GGACACTCCCTAGGGCGCTGTGACATTAAGGACCTGCCAAAAGAGATCAC 
TGTGGCTACATCACGAACGCTTTCTTATTACAAATTAGGAGCGTCGCAGC 
aa^^m^^^^^'^^^^^^^^^^^^^^^^^^^^'^^'CQTATTG^A 
AACTATAAATTAAATACAGACCACGCCGGTAGCAACGACAATATTGCTTT 
GCTAGTACAGTAAGTGACAACAGATGTTTCATCTTGTTGACTTCCAGGTT 
ACAATAGCAGAGATATTGATTATCATTATGAGGACTTTCAGGATTGCTAT 
TTGGAATCTTGACGTTATAATAAGTTCAATAGTGAGACAATTATTTAAGC 
CTCTAACTAAGAAGAATTATTCGGAGTTAGATGATGAAGAACCTATGGAG 
TTAGATTATCCATAAAACGAACATGAAAATTATTCTCTTCCTGACATTGA 

TTGTATTTACATCTTGCGAGCTATATCACTATCAGGAGTGTGTTAGAGGT 
ACGACTGTACTACTAAAAGAACCTTGCCCATCAGGAACATACGAGGGCAA 
TTCACCATTTCACCCTCTTGCTGACAATAAATTTGCACTAACTTGCACTA 
GCACACACTTTGCTTTTGCTTGTGCTGACGGTACTCGACATACCTATCAG 

TCAACAAGAGCTCTACTCGCCACTTTTTCTCATTGTTGCTGCTCTAGTAT 
T^TAATACTTTGCTTCACCATTAAGAGAAAGACAGAATGAATGAGCTCA 
CTTTAATTGACTTCTATTTGTGCTTTTTAGCCTTTCTGCTATTCCTTGTT 
TTAATAATGCTTATTATATTTTGGTTTTCACTCGAAATCCAGGATCTAGA 
AGAACCTTGTACCAAAGTCTAAACGAACATGAAACTTCTCATTGTTTTGA 
CTTGTATTTCTCTATGCAGTTGCATATGCACTGTAGTACAGCGCTGTGCA 
TCTAATAAACCTCATGTGCTTGAAGATCCTTGTAAGGTACAACACTAGGG 
GTAATACTTATAGCACTGCTTGGCTTTGTGCTCTAGGAAAGGTTTTACCT 
TTTCATAGATGGCACACTATGGTTCAAACATGCACACCTAATGTTACTAT 
CAACTGTCAAGATCCAGCTGGTGGTGCGCTTATAGCTAGGTGTTGGTACC 
TTCATGAAGGTCACCAAACTGCTGCATTTAGAGACGTACTTGTTGTTTTA 
AATAAACGAACAAATTAAAATGTCTGATAATGGACCCCAATCAAACCAAC 

gtagtgccccccgcattacatttggtggacccacagattcaactSa^aat 

AACCAGAATGGAGGACGCAATGGGGCAAGGCCAAAACAGCGCCGACCCCA 
AGGTTTACCCAATAATACTGCGTCTTGGTTCACAGCTCTCACTCAGCATC 

gcaaggaggaacttagattccctcgaggccaggkscgttccaatcaIcacc 

AATAGTGGTCCAGATGACCAAATTGGCTACTACCGAAGAGCTACCCGACG 
A ™ TGGTGGTGACGGC ^ TGAAAG AGCTCAGCCCCAG^ 

TCTATTACCTAGGAACTGGCCCAGAAGCTTCACTTCCCTACGGCGCTAAC 
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AAAGAAGGCATCGTATGGGTTGCAACTGAGGGAGCCTTGAATACACCCAA 

AGACCACATTGGCACCCGCAATCCTAATAACAATGCTGCCACCGTGCTAC 

AACTTCCTCAAGGAACAACATTGCCAAAAGGCTTCTACGCAGAGGGAAGC 

AGAGGCGGCAGTCAAGCCTCTTCTCGCTCCTCATCACGTAGTCGCGGTAA 

TTCAAGAAATTCAACTCCTGGCAGCAGTAGGGGAAATTCTCCTGCTCGAA 

TGGCTAGCGGAGGTGGTGAAACTGCCCTCGCGCTATTGCTGCTAGACAGA 

TTGAACCAGCTTGAGAGCAAAGTTTCTGGTAAAGGCCAACAACAACAAGG 

CCAAACTGTCACTAAGAAATCTGCTGCTGAGGCATCTAAAAAGCCTCGCC 

AAAAACGTACTGCCACAAAACAGTACAACGTCACTCAAGCATTTGGGAGA 

CGTGGTCCAGAACAAACCCAAGGAAATTTCGGGGACCAAGACCTAATCAG 

ACAAGGAACTGATTACAAACATTGGCCGCAAATTGCACAATTTGCTCCAA 

GTGCCTCTGCATTCTTTGGAATGTCACGCATTGGCATGGAAGTCACACCT 

TCGGGAACATGGCTGACTTATCATGGAGCCATTAAATTGGATGACAAAGA 

TCCACAATTCAAAGACAACGTCATACTGCTGAACAAGCACATTGACGCAT 

ACAAAACATTCCCACCAACAGAGCCTAAAAAGGACAAAAAGAAAAAGACT 

GATGAAGCTCAGCCTTTGCCGCAGAGACAAAAGAAGCAGCCCACTGTGAC 

TCTTCTTCCTGCGGCTGACATGGATGATTTCTCCAGACAACTTCAAAATT 

CCATGAGTGGAGCTTCTGCTGATTCAACTCAGGCATAAACACTCATGATG 

ACCACACAAGGCAGATGGGCTATGTAAACGTTTTCGCAATTCCGTTTACG 

ATACATAGTCTACTCTTGTGCAGAATGAATTCTCGTAACTAAACAGCACA 

AGTAGGTTTAGTTAACTTTAATCTCACATAGCAATCTTTAATCAATGTGT 

AACATTAGGGAGGACTTGAAAGAGCCACCACATTTTCATCGAGGCCACGC 

GGAGTACGATCGAGGGTACAGTGAATAATGCTAGGGAGAGCTGCCTATAT 

GGAAGAGCCCTAATGTGTAAAATTAATTTTAGTAGTGCTATCCCCATGTG 

ATTTTAATAGCTTCTTAGGAGAATGACAAAAAAAAAAAAAAAAAAAAAAA A 
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229E 

PEDV 

CCov 
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BoCov 
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FGAISSSLQEILSRIiDALEAQAQIDRLINGRLTAIjDAYVSQQLSDSTLVKFS^QAMEW 

FGAISASLQEILSRLDALEAKAQIDRLINGRLTAIjNAOT 

^fASItfEILTM^AVl^^ 

F GAISSVLNDlLSRljDKVEAEVQIDRLITGRLQSLQTYOT 

* * * ? ^* Q ADAQVDRL ITGRL S S LS VLAS AKQ SE Y IRVSQQREL^^KI 

• ! WW. 

w 9 » 

NECVKSQSKRYGFCG-NGTHIFSIViaAAPEGLWLHTvXLPTQYKDVEAWSGLCV-DG-- 
NECVKSQSQRYGFCGGI)GEHIFSLVQAAPQGLLFLHTVLVPGDFvimAIAGLCV-NG-. 
I^IECVRSQSQRFGFCG-NGTHIiFSIjANAAPNGMIFFHTVliLPTAYETVTAWSGICASDGD 
NECVRS Q SQRFGFCG -NGTHLF SLANAAPNGMI FFHTVLLPTAYETVTAWSG I C ALDGDR 

NECVKSQSNRYGFCG-NGTHLFSLV^SAPEGLLFFHTVXLPTEWEEVTAWSGICVNDT-- 
NEC VKSQ S SRINFCG -NGNHI I SLVQNAPYGLYF I HFS YVPTK YVTAKVS PGLC I 

NECVKSQSSRINFCG-NGNHriSLVQNAPYGLYFIHFSYVPTKYVTAKVSPGLCI 

NECVKSQS SRINFCG -NGNHI I SLVQNAPYGLYF IHFS YVPTKYVTAKVSPGLCI 

NECVKSQTTRINFCG-NGNHILSLVQNAPYGLCFIHFSYVPTSFKTANVSPGLCI 

SECVLGQSKRVDFCG-KGYHLMSFPQAAPHGWFLHVTYVPSQERNFTTAPAICH 

NECVKSQ SNRYGFCG - SGRHVL S I PQNAPNG I VF I HFT YTPETFVNVTAI VGFCVNPLNA 

• • • 

TNG YVLRQ PNLAL YK EGNYYRITSRIMFEPRIPTMADFVQIENCNVTFVNISRS 

EIALTLREPGLVLFTHELQTYTATEYFVSSRPvMFEPRKPTVSDFVQI E SCVOTYVNLTSD 

TFGLWKDVQLTLFRN LDDKF YLTPRTMYQ P IVAT S SDFVQI EGCDVLFVNATVI 

TFGLWKDVQLTLFRN LDDKFYLTPRTMYQ PRVATS SDFVQ I EGCDVLFVNTTVS 

-YAYVLKDFDHSIFS YNGTYMVTPRNMFQPRKPQMSDFVQITSCEVTFLNMTYT 

-AGDRGIAPKSGYFVN VimWTGSGYYYPEPITGNNVVvldSTCAVNYTKAPDV 

-AGDRGIAPKSGYFVN VTtfNTWMFTGSRYYYPEP ITGNNVVVMS TCAVNYTKAPDV 

- AGDIG ISPKSG YFIN VNNS WMFTG S SYYYPE P I TQNNWVMS TC A VNYTKAPDL 

- SGDRGLAPKAGYFVQ DNGEWKFTGSNYYYPEPITDKNSVAMI SCAVNYTKAPEV 

- EGKAYF PREGVFVFN GTSWFITQRNFFSPQI ITTDNTFVSGNCDWIGIINNT 

SQYAIVPANGRGIFIQ VNGTYYI TSRDMYMPRDITAGDI VTLTSC QANYVNVNKT 

' ■ i : * : . * . 

ELQTIVP - EYI DVNKTLQELS YKL - PNYTVPDLV VEQYNQTI LNLTS E I STLENK S A 

QLPDVIP-DYIDVmTLDEXrJ\SL-PNRTGPSLP---LDVFNATYLNLTGEIADLEQRSE 

DLPSIIP-DYIDINQTVQDILENFRPNWTVPELP LDIFNATYLNLTGEINDLEFRSE 

DLP S IIP- DYIDINQTVQDI LENFRPNWTVP ELT LDVFNATYLNLTGE I DDLEFRSE 

TFQElVI-DYIDIimTIADMLEQYNPNYTTPELNL-LLDIFNQTKLNLTAEIDQLEQRAD 

MLNI STP -NLHDFKEELDQWFKNQ — TSVAPDLSL- DY — INVTFLDLQDEMN - 

MLNISTP-NLPDFKEELDQWFKNQ — TLVAPDLSL-DY- — INVTFLDLQDEMN 

MLNTSTP-NLPDFKEELYQWFKNQ — SSVAPDLSL-DY — INVTFLDLQDEMN 

FLNNS IP - NLPDFKEELDKWFKNQ — TS IAPDL SL - DFEKLNVTFLDL TYEMN 

VYDPLQ P - ELDSFKEELDKYFKNH TSPDVDLGDI SGINASWNIQKE ID 

VITTFVEDDDFNFDDELSKWWNDT- -KHGLPDFD- — DFNYTVPILNISGEID 



229E 

PEDV 

CCov 

PRC 

FICV 

BoCov 

OC43 

PHEV 

MHV 

TOR2_S 

AIBV 



ELNYTVQKLQTLI DN INSTLVDLKWLNR VT3TYIKWPWWVWLC I SWL I FWSMLLLCCC S 
SLRNTTEELRSLINNINNTLVIDLEWLNRV^TYIKWPWWW 

KLHNTTVELA IL IDNINNTL VNLEWLNR I ET YVKWPWYVWLL IGL WI FCI PILLFCCC S 
KLHNTTvTSLAlLIDNIlWTLVT^^ 

NLTTIAHELQQYIDNIiNKTLTOLDWLNRIETYVlGVPWYVWL 

RI^PAIKVLNQSYINLKDIGTYEYYVICWPWYVWLLIGFAGVAMLVLLFFICCC 

RLQEAIKVXNQSYINLKDIGTYEYYVlCWPVm^LIGFAGVAMLVLLFFICCC 

-RI^EAIKVTiNQSYINLKDIGTYEYYVKWPWYVW^ 

RIQDAI KKLNE S YINLKEVGTYEMYVlCWPWYvl^L IGLAGVAVC VLLFF ICCC 

RIjNEVAKNLNESL IDLQELGKYEQ YIKWPWYVWLGFIAGLIA I VMVTI LLCCM 

NlQGVIQ^LNDSLINLEELSIIKTYIKWPVmmLAIGFAIIlFILILGWVFFM 

• ■ . :*.: ::*. • : *.****.*** . 
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22 9E TGCCG-FFSCFASSIRGCCESTKD-PYYDVEKIHIQ 

PEDV TGCCG-CCGCCGACFSGCCRGPRIiQPYEAFEKVHVQ 

CCov TGCCG-CIGCLGSCCHSICSRRQFESYEPIEKVHVH 

PRC TGCCG-CIGCLGSCCHSIFSRRQFENYEPIEKVHVH 

FICV TGFCG-CFGCVGSCCHSLCSRRQFETYEPIEKVHIH 

BoCov TGCGTSCFKICGGCCD-DYTGHQELVTK— -TSHDD 

OC43 TGCGTSCFKKCGGCCD-DYTGHQELVIK TSHEG 

PHEV TGCGTSCFKKCGGCCD- DYTGHQEFVIK TSHDD 

MHV TGCGSCCFRKCGSCCD- EYGGHQDS IVIHNI SAHED 

T?™- S TSCCSCLKGACSCGSCCKFDEDDSEPVLKGVKLHYT 

TGCCGCCCGCFG 1 1 PL I SKCGKKS S YYTTFDND WTEQ YRPKKS V 



AIBV > 



Key Name 
229E 



spike glycoprotein [Human coronavirus 229E1 . L « , 

AIBV spike glycoprotein [Avian infectious bronchitis virus! ^S^os^'*! SEQ ID N0: 53 > 

BoCov E2 glycoprotein precursor (Spike glycoprotein^ ' ^il** IVll (SEQ ID NO: 54) 

<™» spike protein - canine coronavirus P ^ 5193 30 - 5% <SEQ ID N0: 55 > 

peplomer protein [Feline infectious peritonitis virus] p !f L 11 '11 ! SEQ ID N ° : 

SJS?=2S; ~~ ^ .lycoproteinr ^ ' ™5£** »; £ « E? 2. 



CCoV 
FICV 
MHV 
OC43 
PEDV 



56) 
57) 



surface protein Z hu^ cor^Irus P ' 2"?? 31 - 9% <SE « ID 58) 

spike protein [Porcine epidemic diarrhea virusl Itili^ 30 - 7% <SEQ ID NO: 591 

PHEV spike glycoprotein tDorcinBh»m»^T,.;V^ t • ' ' CAA80971 26.0% (SEQ ID NO: 60) 

PRC S protein [ P orc?ne ret^ra^corona^rus? 9 enCephal °'^ elit " virus, AAL80031 30.5% (SEQ ID NO: 61, 

TOR2.S Sars associated virus S glycoprotein (SBQXDNO: 33, AAA46905 27.5% (SEQ ID NO: 62) 
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10 20 30 40 50 

TOR2_E MYS FVS EETGTL ^NSVLLFIiAFVVFLLVTLAIIjTALRIiC AYCCNIVNVSLVKPTV 

_ ■ » • . 

* * * • • • • * » » a ■«•« * * • * • • • • 

PGV MTFPRALTVTDDNG-MVINI IFWFLLII ILILLS I ALLNI IKI/CMVCCNMRTVI IVPAQ 

10 20 30 40 50 

60 70 

TOR2_E YVYSRVKNLNS S EGVPDLLV (SEQ ID NO: 35) 

< » » 
• • • • • • • 

PGV HAYDAYKNFMRIKAYNPDGALLA (SEQ ID NO: 63) 
60 70 80 
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ME S L VLGVNEKTHVQ I» S L PVLQ VRDVLVRG FGDS VEEAL S E AREHLKNGT 

CGLVELEKGVLPQIiEQPYVFIKRSDALSTNHGHKWELVAEMDGIQYGRS 
G I TLGVL VPHVG ET P I A YRNVLLRKNGNKG AGGH S YG IDLKSYDLGDELG 
TDP I ED YEQNWNTKHG SG ALRELTRELNGG AVTR YVDNNF CG P DG Y P LDC 
IKDFLtARAGKSMCTLSEQLDYI ESKRGVY CCRDHEHE I AWFTERSDKS YE 
HQTPFEIKSAKKFDTFKGECPKFVFPIiNSKVKVIQPRVEKKKTEGFMGRI 
RSVYFVAS PQECNNMHL STLMKCNHCDEVSWQTCDF LKATCEHCGTENLV 
I EGPTTCGYL PTNAWKMPC PAC QDPE IGPEHSVADYHNHSNI ETRLRKG 
GRTRCFGGCVFAYVGC YNKRAYWVPRAS ADIG SGHTG ITGDNVETTiNEDL 

LEILSRERVNINTVGDFHIiNEEVAIIIiASFSASTSAFIDTIKSLDYKSFK 
TIVESCGNYKVTKGKPVKGAWNIGQQRSVLTPLCGF P S QAAGVIRS I FAR 

TLDAANHSIPDLQRAAVTILDGISEQSLRLVDAMVYTSDLLTNSVIIMAY 
VTGGLVQQTS QWLSNLLGTTVEKLRP I FEWI EAKLS AGVEFLKDAWE I LK 
F L ITGVFDI VKGQIQVAS DNIKDCVKC F IDWNKALEMC IDQ VTI AGAKL 
RSLNLGEVF IAQSKGLYRQC IRGKEQLQLIiMPIiKAPKEVTFLEGDSHDTV 
LTSEEVVLKNGELEALETPVDSFTNGAIVGTPVCVNGIiMLLEIKDKEQYC 
ALS PGLLATNNVFRLKGGAP I KGVTFGEDTVWEVQG YKNVRITFELDERV 
DKVLNEKC SVYTVESGTEVTEFAC WAEAWKTLQ PVSDLLTNMG I DLDE 
WSVATFYIiFDDAGEENFS SRMYC SFYPPDEEEEDDAECEEEEIDETCEHE 
YGTEDDYQGLPLEFGASAETVRVEEEEEEDWLDDTTEQSEIEPEPEPTPE 
E PVNQFTG YLKLTDNVAI KC VD I VKEAQ S ANPMVT VNAANIHLKHGGGVA 
G ALNKATNGAMQKE SDDYI KLNGPLTVGG SCLL SGHNLAKKC LHWGPNL 
NAGEDIQLLKAAYENFNSQDILLAPLLSAGIFGAKPLQSLQVCVQTVRTQ 
VYIAVNDKALYEQWMDYLDNLKPRVEAPKQEEPPNTEDSKTEEKSWQK 
PVDVKPKIKAC IDEVTTTLEETKFLTNKLLLFADINGKLYHDSQNMLRGE 
DMS F LEKDAPYMVGDVITSGDI TCVVI P SKKAGGTTEMI»SRALKKVPVDE 
YITTYPGQGCAGYTLEEAKTALKKCKSAFYVLPSEAPNAKEEILGTVSWN 
IiREMLAHAEETRKLMPICMDVRAIMATIQRKYKGIKIQEGIVDYGVRFFF 
YTSKE P VAS I ITKLNSLNE PLVTMP IGYVTHGFNLEEAARCMRSLKAPAV 
VSVSSPDAVTTYNGYLTSSSKTSEEHFVETVSLAGSYRDWSYSGQRTEIiG 
VEFLKRGDKIVYHTI/E S PVE FHLDGEVL SLDKLKSLL S LREVKTIKVFTT 
VDNTNLHTQL VDMS MTYGQ Q FG PT YLDG AD VTK I KP HVNHEG KTF F VL> P S 
DDTLRSEAFEYYHTLDESFLGRYMSALNHTKKWKFPQVGGIiTSIKWADNN 
CYLSSVLLALQQLEVKFNAPALQEAYYRARAGDAANFCALILAYSNKTVG 
ELGDVRETMTHLLQHANLES AKRVLNVVC KHCGQKTTTLTGVE AVMYMGT 
LS YDNLKTGVS I PC VCGRDATQYLVQQ ESS FVMMS AP PAEYKLQ QGTFLC 

ANEYTGNYQCGHYTHITAKETLYRIDGAHLTKMSEYKGPVTDVFYKETSY 
TTTIKPV S YKIiDGVTYTE I EPKLDG YYKKDNAYYTEQ P I DLVPTQPL PNA 
S FDNFKLTC SNTKFADDLNQMTGFTKPASREL SVTFFPDLNGDWAIDYR 
HYSASFKKGAKLLHKPIVWHINQATTKTTFKPNTWCLRCLWSTKPVDTSN 
SFEVIiAVEDTQGMDNLACESQQPTSEEVVENPTIQKEVIECDVKTTEVVG 
NVILKPSDEGVKVTQEIjGHEDIjMAAYVENTSITIKKPNELSLAIjGrjKTIA 
THGIAAINSVPWSKILAYVKPFLGQAAITTSNCAKRLAQRVFNNYMPYVF 
TLIiFQLCTFTKSTNSRIRASLPTTI AKNSVKSVAKLCLDAG INYVKS PKF 
SKLFTIAMWIiLLLSICLGSLICVTAAFGVLLSNFGAPSYCNGVRELYLNS 
SNVTTMDFCEGSFPC S ICL SGLDS LDS YP ALETI QVTI S S YKLDLTI LGL 
AAEWVLAYMLFTKFFYLLGLSAIMQVFFGYFASHFI SNSWLMWFI I S I VQ 
MAPVSAMVRMYIFFASFYYIWKSYVHIMDGCTSSTCMMCYKRNRATRVEC 
TTIVNGMKRSFYVYANGGRGFCKTHNWNC LNCDTFCTG STF I SDE VARDL 

SLQFKRPINPTDQSSYIVDSVAVKNGALHLYFDKAGQKTYERHPLSHFVN 
LDNLRANNTKG SL P INVI VFDGKS KCDE S ASKSAS VYYSQLMCQ P ILLLD 

QALVSDVGDSTEVSVKMFDAYVDTFSATFSVPMEKLKALVATAHSELAKG 
VALDGVLSTFVSAARQGVVDTDVDTKDVIECLKLSHHSDIiEVTGDSCNNF 
MLTYNKVENMTPRDLGAC IDCNARH inaq VAKSHNVS liwnvkdymsls e 

QLRKQIRSAAKKNNIPFRLTCATTRQVVI^ITTKISLKGGKIVSTCFKIjM 

lkatllcvlaalvcyivmpvhtlsihdgytneiigykaiqdgvtrdiist 
ddcfankhagfdawfsqrggsykndkscpwaaiitreigfivpglpgtv 
lraingdflhflprvfsavgnicytpsklieysdfatsacvlaaectifk 
damgkpvpycydt^legsisyselrpdtryvlmdgsiiqfpntylegsv 
rvvttfdaeycrhgtcersevgiclstsgrwvlnnehyralsgvfcgvda 
mnl iani ftplvqpvgaldvs aswagg 1 1 a ilvtc aa yyfmkfrrvfge 
ynhwaanallflmsftilclvp ays fl pgvysvfyl yltf yftndvs fl 

AHLQV^AMFSPIVPFWITAIYVFCISLKHCHWFFNNYXiRKRVMFNGVTFS 
TFEEAALCTFLLNKEMYLKLRSETLLPLTQYl^YliALYNKYKYFSGA^ 
TSYREAACCHLAKALNDFSNSGADVLYQPPQTSITSAVLQSGFRKMAFPS 
GKVEGCMVQVTCGTTTLNGLWLDDTVYC PRHVICTAEDMLNPNYEDLL I R 

KSNHSFLVQAGNVQLRVIGHSMQNCLLRLKVDTSNPKTPKYKFVRIQPGQ 
TF S VI*AC YNG S PSGVYQCAMRPNHTI KG S FLNG SCG SVGFN I DYDCVS FC 

YMHHMELPTGVHAGTDLEGKFYGPFVDRQTAQAAGTDTTITLl«nnjAWLYA 
AVINGDRV^L^FTTTI^FNIiVAMKYNYEPLTQDHVDIU3PLSAQTGIA 
VLDMCAALKELLQNGMNGRTILGSTILEDEFTPFDWRQCSGVTFQGKFK 
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KIWGTHHWMLLTFLTSLIilLVQSTQWSLFFFVYENAPLPFTLGIMAXAA 
C AMLL VKHKHAFLCL FLL P S LA WAYFTtfMVYMP AS WVMRJ MTWL ELADT S 
LSGYRLKDCVMYASALVLLILOT 

GNAIiDQAlSMWALVISVTSNYSGVVTTIMFLARAIVFVCVEYYPLLFITG 
NTLQC IMLVYCFLGYCCCC YFGLFC LLNRYFRLTLGVYDYLVSTQEFR YM 

NSQGLLPPKSSIDAFKLNIKLLGIGGKPCIKVATVQSKMSDVKCTSWLL 
S VLQQLRVES S S KIjWAQCVQLHNDILIAKDTTEAFEKMVS LLS VTjLSMQG 

AVDINRLCEEMLDNRATLQAIASEFSSLPSYAAYATAQEAYEQAVANGDS 
EVVLKKLKKSIiNVAKS EFDRDAAMQRKLEKMADQAMTQMYKQARS EDKRA 
KVTS AMQTMLFTMLRKLDNDALNNI INNARDGCVPLNI I PLTTAAKLMW 
VPD YGTYKNTCDGNTFTYAS ALWE I QQWDADSKI VQLS E INMDNS PNLA 
WPLIVTALRANSAVKLQNMELS P VALRQM S C AAG TTQ TAC TDDNAL A YYN 
NSKGGRFVIALLSDHQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGP 
KVKYLYFIKGLNNLNRGMVIX5SriAATVRLQAGNATEVPANSTVLSFCAFA 
VDPAKAYKDYLASGGQPITNCVKMLCTHTGTGQAITVTPEANMDQESFGG 

ASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTTCANDPVGFTLRNTVCTVC 
GMWKGYGCSCDQIiRE PLMQSADASTF 

(SEQ ID NO: 64) 
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FKRVCG 

VSAARLTPCGTGTSTDWYRAFDIYNEKVAGFAKFLKTNCCRFQEKDEEG 
^^SYFWKRH1MSNYQHEETIYNLVKDCPAVAVHDFFKFRVIX3DMVPH 
ISRQRLTKYTMADLVYALRHFDEXSNCOTLK^ILVTYNCCDDDYFNKKDWY 
DFVENPDI LRWANIX3ERVRQ SI*LKTVQFCDAMRDAG I VGVLTLDNQDLN 
GNWYDFGDFVQ VAPGC GVP I VD S YYS LLMP I LTLTRALAAE SHMD ADLAK 

plikwdllkydfteeru:lfdryfkywd^^ 

nvlfstvfpptsfgplvrkifvdgvpfvvstgyhfrelgvvhnqdvnlhs 
srlsfkellvyaadpamhaasgnllldkrttcfsvaaltnnvafqtvkpg 
nfnkdf ydfavskgffkeg s svelkhfffaqdgnaai sd yd yyrynlptm 

CDIRQLLFVVEVVDKYFDCYDGGCIl^QVIVNNLDKSAGFPFNKWGKAR 
LYYDSMSYEDQDAIjFAYTKRNVI PTITQMNItKYAI SAKNRARTVAGVSIC 

STMTNRQFHQKLLKSIAATRGATWIGTSKFYGGWHNMIiKTVYSDVETPH 
LMGWDYPKCDRAMPNMIiRIMAS LVLARKHNTCCNL S HRFYRLANECAQ VL 

SEMVMCGGSLYVKPGGTSSGDATTAYANSVFNICQAVTANVNALLSTDGN 
KI ADKYVRNLQHRLYECL YRNRDVDHEFVDEFYAYLRKHF SMMILS DD AV 

VCYNSNYAAQGLVASIKNFKAVLYYQNNVFMSEAKCWTETDLTKGPHEFC 
SQHTMLVKQGDDYVYLPYPDPSRILGAGCFVDDIVKTDGTLMIERFVSLA 
IDAYPLTKHPNQEYADVFHLYLQYIRKLHDELTGHMLDMYSVMLTNDNTS 
RYWEPEFYEAMYTPHTVLQAVGACVIjCNSQTSLRCGACIRRPFLCCKCCY 
DHVI STSHKLVLSVNPYVCNAPGCDVTDVTQL YLGGMSYYCKSHKPPI SF 
PIiCANGQWGIiYKNTCVGSDNVTDFNAIATCDWTNAGDYILANTCTERLK 
LFAAETLKATEETFKLSYGIATVREVIjSDRELHLSWEVGKPRPPLNRNYV 

ftgyrvtknskvqigeytfekgdygdawyrgtttyklnvgdyfvltsht 
vmplsaptlvpqehyvritglyptlnisdefssnvanyqkvgmqkystlq 

G PPGTGKS HFAI GLALYYP S ARXVYTAC SHAAVDALCEKALKYLP IDKC S 
RIIPARARVECFDKFKVNSTLEQYWCTVNALPETTADIVVFDEISMATN 
YDLSVVNARLRAKHYVYIGDPAQLPAPRTLLTKGTLEPEYFNSVCRIiMKT 
IGPDMFLGTCRRC PAE I VDTV SALVYDNKLKAHKDKS AQCFKMF YKGVIT 
HDVSSAINRPQ IGWREFLTRNPAWRKAVFI SPYNSQNAVASKIIiGLPTQ 
TVDS SQGSEYDYVIFTQTTETAHSCNVNRFNVAITRAKIGILC IMSDRDL 
YDKLQFTSLEIPRRWATLQAENVTGLFKDCSKIITGLHPTQAPTHIiSVD 
IKFKTEGLCVD I PG I PKDMTYRRL I SMMGFKMNYQVNGYPNMF ITREEAI 
RHVRAW IGF DVEG C HATRD AVG TNL PL QLGF S TG VNLVAVPTG YVDTENN 
TEFTRVNAKPPPGDQFKHLIPIiMYKGtjPWNVVRIKIVQMljSDTLKGIiSDR 
WFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWNHS 
VGFDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCL 
AVHECFVKRVDWSVEYPI IGDELRVNSACRKVQHMWKSALLADKFPVLH 
DIGNPKAIKCVPQAEVEWKFYDAQPCSDKAYKIEELFYSYATHHDKFTDG 
VCLFT^CNVDRYPANAI VCRFDTRVLSNLNLPGC DGG SLYVl^KHAFHTPA 
FDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDIDYVPLKSATCITRCNLG 
GAVCRHHANEYRQYLDAYNMMI S AGF S LWI YKQF DTYNLWNT FTRLQ SLE 
NVAYNWNKGHFDGHAGEAPVS I INNAVYTKVDG I DVE I FENKTTLP VNV 
AFELWAKRNIKPVPEIKILNNLGVDIAANTOIWDYKREAPAHVSTIGVCT 
MTDI AKKPTE SAC S S LTVLFDGRVEGQVDLFRNARNGVL I TEG SVKGLTP 
SKGPAQASVNGVTLIGE SVKTQFNYFKKVDG I IQ QLPETYFTQ SRDLEDF 

KPRSQMETDFLELAMDEFIQRYKLEGYAFEHIVYGDFSHGQLGGLHLMIG 
LAKRSQDS PLKLEDF I PMDS TVKNYFITDAQTG S S KC VC S VIDLLLDDFV 
E I IKSQDLS VI SKWKVT IDYAE I S FMLWCKDGHVETFYPKLQAS QAWQ P 
GVAMPNL YKMQRMLLEKCDLQNYGENAVI PKG IMMNVAKYTQLCQ YLNTL 
TLAVPYNMRVI HFGAGSDKGVAPGTAVLRQWL PTGTLLVDSDLNDFVSDA 
DSTLIGIXATVHTANKWDLIISDMYDPRTKHVTKENDSKEGFFTYXiCGFI 
KQKLALGGSIAVKITEHSWNADLYKLMGHFSWWTAFVTNVNASSSE 
GANYLGKPKEQIDGYTMHA^IFWRNTNPIQLSSYSLFDMSKFPLKIiRGT 
AVMSLKENQ INDMI YS LLEKGRL 1 1 RENNRVWS SD ILVNN 

(SEQ ID NO: 65) 
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MDLFMRF FTLRS I TAQPVK I DNAS PASTVHATAT I PLQ ASLPFGWLVI GV 

AFLAVFQSATKIIALNKRWQLALYKGFQFICNLLLLFVTIYSHLLLVAAG 
MEAQFLYLYALIYFLQCINACRIIMRCWLCWKCKSKNPLLYDANYFVCWH 
THNYDYC I P YNS VTDT I WTEGDG I S TPKLKED YQ I GGYSEDRHS GVKDY 

VWHGYFTEVYYQLESTQITTDTGIENATFFIFNKLVKDPPNVQIHTIDG 
SSGVANPAMDPIYDEPTTTTSVPL (SEQ ID NO: 66) 



FIGURE 18 



MMPTTLFAGTHITMTTVYHITVSQIQLSLLKVTAFQHQNSKKTTKLWIL 

RIGTQVLKTMSLYMAISPKFTTSLSLHKLLQTLVLKMLHSSSLTSLLKTH 

RMCKYTQSTALQELLIQQWIQFMMSRRRLLACLCKHKKVSTNLCTHSFRK 
KQVR (SEQ ID NO: 67) 

FIGURE 19 



MFHLVDFQVT I AE IL I I IMRTFRI AIWNLDVI I SS I VRQLFKPLTKKNYS 
ELDDEEPMELDYP (SEQ ID NO: 68) 

FIGURE 20 



MKIILFLTLIVFTSCELYHYQECVRGTTVLLKEPCPSGTYEGNSPFHPLA 

DNKFALTCTSTHFAFACADGTRHTYQLRARSVSPKLFIRQEEVQQELYSP 
LFLIVAALVFLILCFTIKRKTE (SEQ ID NO: 69) 

FIGURE 21 



MNELTLIDFYLCFLAFLLFLVLIMLIIFWFSLEIQDLEEPCTKV 

(SEQ ID NO: 70) 

FIGURE 22 



MKLLI VLTC I SLC SC I CTWQRCASNKPHVLEDPCKVQH 
(SEQ ID NO: 71) 

FIGURE 23 



MCLKILVRYNTRGNTYSTAWLCALGKVLPFHRWHTMVQTCTPNVTINCQD 
PAGGAL I ARCWYLHEGHQTAAFRDVL WLNKRTN (SEQ ID NO: 72) 
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FIGURE 24 



MDPNQTNWPPALHLVDPQIQLTITRMEDAMGQGQNSADPKVYPIILRLG 
SQLSLSMARRNLDSLEARAFQSTPIWQMTKLATTEELPDEFVWTAK 

(SEQ ID NO: 73) 

FIGURE 25 



MLPPCYNFLKEQHCQKASTQREAEAAVKPLLAPHHWAVIQEIQLLAAVG 
EILLLEWLAEWKLPSRYCC (SEQ ID NO: 74) 



FIGURE 26 

CIAVGQLCVFWNIGRPCCSGLCVFA— CTVKL conotoxin 

CISLCS-CICTWQRCASNKPHVLEDPCKVQH sars 
**-- *• •* *. **. 



FIGURE 27 



